FIGURE 1 



GGGGCTTCGGCGCCAGCGGCCAGCGCTAGTCGGTCTGGTAAGGATTTACAAAAGGTGCAGGTATG 
AGCAGGT C T GAAGAC TAACAT T T TGTGAAGT TGTAAAACAGAAAACC T GT TAGAAATGT GGT GGT 
TTCAGCAAGGCCTCAGTTTCCTTCCTTCAGCCCTTGTAATTTGGACATCTGCTGCTTTCATATTT 
TCATACAT TACT GCAGT AACAC T CCACCAT ATAGACCCGGCT T TACC T T AT ATCAGTGACACT GG 
TACAGTAGC T CCAGAAAAAT GC T TAT T T GGGGCAATGC TAAAT AT TGC GGCAGT T T TAT GCAT TG 
C TACCAT T T AT GT TCGTTAT AAGCAAGT TCATGC TC T GAGT CC T GAAGAGAACGT TATCATCAAA 
TTAAACAAGGCTGGCCTTGTACTTGGAATACTGAGTTGTTTAGGACTTTCTATTGTGGCAAACTT 
CCAGT^AAACAACCCTTTTTGCTGCACATGTAAGTGGAGCTGTGCTTACCTTTGGTATGGGCTCAT 
TATATATGTTTGTTCAGACCATCCTTTCCTACCAAATGCAGCCCAAAATCCATGGCAAACAAGTC 
TTCTGGATCAGACTGTTGTTGGTTATCTGGTGTGGAGTAAGTGCACTTAGCATGCTGACTTGCTC 

AT CAGTT T T GCACAGT GGCAAT T T TGGGAC TGAT TTAGAACAGAAAC T CCAT TGGAACCCCGAGG 
ACAAAGGTTATGTGCTTCACATGATCACTACTGCAGCAGAA.TGGTCTATGTCATTTTCCTTCTTT 
GGTTTTTTCCTGACTTACATTCGTGATTTTCAGAAAATTTCTTTACGGGTGGAAGCCAATTTACA 
TGGATTAACCCTCTATGACACTGCACCTTGCCCTATTAACAATGAACGAACACGGCTACTTTCCA 
GAGATATTTGATGAAAGGATAAAATATTTCTGTAATGATTATGATTCTCAGGGATTGGGGAAAGG 
TTCACAGAAGTTGCTTATTCTTCTCTGAAATTTTCAACCACTTAATCAAGGCTGACAGTAACACT 
GAT GAAT G C T GAT AAT CAG GAAAC AT G AAA.GAA.GCCAT T T G ATAGAT T AT T C TAAAGGAT AT CAT 
C AAG AAG AC TAT T AAAAAC AC C T AT GC C TAT AC T T T T T TAT C T C AGAAAAT AAA.G T CAAAA.G AC T 
ATG 



FIGURE 2 



<subunit 1 of 1, 266 aa, 1 stop 
<MW: 29766, pi: 8.39, NX(S/T): 0 

MWWFQQGLSFLPSALVIWTSAAFIFSYITAVTLHHIDPALPYISDTGTVAPEKCLFGAMLNIAAV 
LCIATIYTOYKQVHALSPEENVIIKLNKAGLVLGILSCLGLSIVANFQKTTLFAAHVSGAVLTFG 
MGSLYMFVQTILSYQMQPKIHGKQVFWIRLLLVIWCGVSALSMLTCSSVLHSGNFGTDLEQKLHW 
NPEDKGYVLHMITTAAEWSMSFSFFGFFLTYIRDFQKISLRVEANLHGLTLYDTAPCPINNERTR 

LLSRDI 




Important features: 

Type II transmembrane domain: 

amino acids 13-33 



Other Transmembrane domains: 

amino acids 54-73, 94-113, 160-180, 122-141 




N-myristoylation sites . 

amino acids 57-63, 95-101, 99-105, 124-130, 183-189 



ru 
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FIGURE 3 



CGGACGCGTGGGCGGACGCGTGGGGGAGAGCCGCAGTCCCGGCTGCAGCACCTGGGAGAAGGCAGACC 
GTGTGAGGGGGCCTGTGGCCCCAGCGTGCTGTGGCCTCGGGGAGTGGGAAGTGGAGGCAGGAGCCTTC 
CTTACACTTCGCCATGAGTTTCCTCATCGACTCCAGCATCATGATTACCTCCCAGATACTATTTTTTG 
GATTTGGGTGGCTTTTCTTCATGCGCCAATTGTTTAAAGACTATGAGATACGTCAGTATGTTGTACAG 
GTGATCTTCTCCGTGACGTTTGCATTTTCTTGCAGCATGTTTGAGCTCATCATCTTTGAAATCTTAGG 
AGTATTGAATAGCAGCTCCCGTTATTTTCACTGGAAAATGAACCTGTGTGTAATTCTGCTGATCCTGG 
TTTTCATGGTGCCTTTTTACATTGGCTATTTTATTGTGAGCAATATGCGACTACTGCATAAACAACGA 
CTGCTTTTTTCCTGTCTCTTATGGCTGACCTTTATGTATTTCTTCTGGAAACTAGGAGATCCCTTTCC 
CATTCTCAGCCCAAAACAT GGGATCTTAT CCATAGAACAGCTCATCAGCCGGGTTGGTGTGATT GGAG 
•PI TGACTCTCATGGCTCTTCTTTCTGGATTTGGTGCTGTCAACTGCCCATACACTTACATGTCTTACTTC 
0! CTCAGGAATGTGACTGACACGGATATTCTAGCCCTGGAACGGCGACTGCTGCAAACCATGGATATGAT 
bl CATAAGCAAAAAGAAAAGGATGGCAATGGCACGGAGAACAATGTTCCAGAAGGGGGAAGTGCATAACA 

AACCATCAGGTTTCTGGGGAATGATAAAAAGTGTTACCACTTCAGCATCAGGAAGTGAAAATCTTACT 
CTTATTCAACAGGAAGTGGATGCTTTGGAAGAATTAAGCAGGCAGCTTTTTCTGGAAACAGCTGATCT 
ATAT GCTACCAAGGAGAGAAT AGAATACT CCAAAAC CTTCAAGGGGAAATATT TT AATT T T CT TGGTT 



SI 

W 



tot 

m 



m ACTTTTTCTCTATTTACTGTGTTTGGAAAATTTTCATGGCTACCATCAATATTGTTTTTGATCGAGTT 
Iff GGGAAAACGGATCCTGTCACAAGAGGCATTGAGATCACTGTGAATTATCTGGGAATCCAATTTGATGT 

C| GAAGT TT T GGT CCCAACAC AT TT CCTT CATTCT TGT T GGAATAATCATCGTCACAT CCAT CAGAGGAT 

TGCTGATCACTCTTACCAAGTTCTTTTATGCCATCTCTAGCAGTAAGTCCTCCAATGTCATTGTCCTG 
CTATTAGCACAGATAATGGGCATGTACTTTGTCTCCTCTGTGCTGCTGATCCGAATGAGTATGCCT.TT 
AGAATACCGCACCATAATCACTGAAGTCCTTGGAGAACTGCAGTTCAACTTCTATCACCGTTGGTTTG 
ATGTGATCTTCCTGGTCAGCGCTCTCTCTAGCATACTCTTCCTCTATTTGGCTCACAAACAGGCACCA 
GAGAAGCAAAT GGCACCT TGAACTT AAGCCTAC TACAGACTGT TAGAGGCCAGTGGT TT CAAAAT TTA 
GATATAAGAGGGGGGAAAAATGGAACCAGGGCCTGACATTTTATAAACAAACAAAATGCTATGGTAGC 
ATTTTTCACCTTCATAGCATACTCCTTCCCCGTCAGGTGATACTATGACCATGAGTAGCATCAGCCAG 
AACAT GAGAGGGAGAAC TAACTCAAGACAATACTCAGCAGAGAGCAT CCCGTGTGGATAT GAGGCTGG 
T GTAGAGGCGGAGAGGAGCCAAGAAACTAAAGGTGAAAAATACACTGGAAC T CTGGGGCAAGACATGT 
CTATGGTAGCTGAGCCAAACACGTAGGATTTCCGTTTTAAGGTTCACATGGAAAAGGTTATAGCTTTG 
CCTTGAGATTGACTCATTAAAATCAGAGACTGTAACAAAAAAAAAAAAAAAAAAAAAGGGCGGCCGCG 
ACTCTAGAGTCGACCTGCAGAAGCTTGGCCGCCATGGCCCAACTTGTTTATTGCAGCTTATAATG 
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FIGURE A 

MS FL IDSS IMI TSQI LFFGFGWLFFMRQLFKDYE IRQYWQVI FSVT FAFSCTMFEL 1 1 FEILGV 
LNSSSRYFHWKMNLCVILLILVFl^ 

FPILSPKHGILSIEQLISRVGVIGVTI^LLSGFGAVNCPYTYMSYFLRNVTDTDILALERRLLQ 
TMDMIISKKKRMAMZ\RRTMFQKGEVHNKPSGFWGMIKSVTTSASGSENLTLIQQEVDALEELSRQ 
LFLETADLYATKERIEYSKTFKGKYFNFLGYFFS IYCVWKI FMATINIVFDRVGKTDPVTRGIEI 
TWYLGIQFDVKFWSQHISFILVGIIIVTSIRGLLITLTKFFYAISSSKSSNVIVLLLAQIMGMY 
FVSSVLLIRMSMPLEYRT 1 1 TEVLGELQFNFYHRWFDVT FLVSALS S ILFLYLAHKQAPEKQMAP 

Important features : 
Signal peptide: 

amino acids 1-23 

Potential transmembrane domains: 

amino acids 37-55, 81-102, 150-168, 288-311, 338-356, 375-398, 
425-444 

N-glycosylation sites. 

amino acids 67-70, 180-183 and 243-246 

Eukaryotic cobalamin-binding proteins 

amino acids 151-160 



APPJD=1 0063732 



Page 158 



FIGURE 5 

AGCAGGGAAATCCGGATGTCTCGGTTATGAAGTGGAGCAGTGAGTGTGAGCCTCAACATAGTTCC 
AGAACTCTCCATCCGGACTAGTTATTGAGCATCTGCCTCTCATATCACCAGTGGCCATCTGAGGT 
GTTTCCCTGGCTCTGAAGGGGTAGGCACGATGGCCAGGTGCTTCAGCCTGGTGTTGCTTCTCACT 
TCCATCTGGACCACGAGGCTCCTGGTCCAAGGCTCTTTGCGTGCAGAAGAGCTTTCCATCCAGGT 
GTCAT G CAGAAT TAT GGGGATCACC C TT GT GAGCAAAAAGGCGAACCAGCAGC T GAAT T TCACAG 
AAGCTAAGGAGGCCTGTAGGCTGCTGGGACTAAGTTTGGCCGGCAAGGACCAAGTTGAAACAGCC 
TTGAAAGCTAGCTTTGAAACTTGCAGCTATGGCTGGGTTGGAGATGGATTCGTGGTCATCTCTAG 
GATTAGCCCAAACCCCAAGTGTGGGAAAAATGGGGTGGGTGTCCTGATTTGGAAGGTTCCAGTGA 
GCCGACAGTTTGCAGCCTATTGTTACAACTCATCTGATACTTGGACTAACTCGTGCATTCCAGAA 
AT TAT CAC CAC CAAAGATCCCATAT T CAACAC T C AAACT GCAACACAAACAACAGAAT T TAT T GT 
CAGTGACAGTACCTACTCGGTGGCATCCCCTTACTCTACAATACCTGCCCCTACTACTACTCCTC 
CTGC TCCAGCTTCCACTTCTAT TCCACGGAGAAAAAAATT GATT TGTGTCACAGAAGT TTT TATG 
GAAAC T AGCACCAT GTCT ACAGAAAC TGAACCAT T T GT T GAAAATAAAGCAGCAT TCAAGAAT GA 
AGCTGCTGGGTTTGGAGGTGTCCCCACGGCTCTGCTAGTGCTTGCTCTCCTCTTCTTTGGTGCTG 
CAGCTGGTCTTGGATTTTGCTATGTCAAAAGGTATGTGAAGGCCTTCCCTTTTACAAACAAGAAT 
CAGCAGAAGGAAATGATCGAAACCAAAGTAGTAAAGGAGGAGAAGGCCAATGATAGCAACCCTAA 
T GAGGAAT CAAAGAAAACT GATAAAAAC CCAGAAGAGTCCAAGAGTCCAAGCAAAAC TACCGTGC 
GAT GC CT GGAAGC T GAAGT T TAGA TGAGACAGAAAT GAGGAGACACACCT GAGGC TGGT TT CT T T 
CATGCTCCTTACCGTGCCCCAGCTGGGGAAATCAAAAGGGCCAAAGAACCAAAGAAGAAAGTCCA 
CCCTTGGTTCCTAACTGGAATCAGCTCAGGACTGCCATTGGACTATGGAGTGCACCAAAGAGAAT 
GCCCTTCTCCTTATTGTAACCCTGTCTGGATCCTATCCTCCTACCTCCAAAGCTTCCCACGGCCT 
TTCTAGCCTGGCTATGTCCTAATAATATCCCACTGGGAGAAAGGAGTTTTGCAAAGTGCAAGGAC 
CTAAAACATCTCATCAGTATCCAGTGGTAAAAAGGCCTCCTGGCTGTCTGAGGCTAGGTGGGTTG 
AAAGCCAAGGAGTCACTGAGACCAAGGCTTTCTCTACTGATTCCGCAGCTCAGACCCTTTCTTCA 
GCTCTGAAAGAGAAACACGTATCCCACCTGACATGTCCTTCTGAGCCCGGTAAGAGCAAAAGAAT 
GGCAGAAAAGT TT AGCCCC T GAAAGCCAT GGAGAT T CTCATAAC T TGAGACC TAAT CT CTGTAAA 
GC TAAAATAAAGAAATAGAACAAGGC T GAGGATACGACAGT ACAC TGT CAGCAGGGAC T GT AAAC 
ACAGACAGGGT CAAAGT G T T T TC TCTGAACACAT TGAG TT GGAAT CAC TGTT TAGAACACACACA 
C T TACT T TTT C TGGTC T CT ACCACT GCT GATAT T TT CT CT AGGAAATAT AC TTT TACAAGTAACA 
AAAATAAAAACT CT TAT AAATT TC TAT TTT TATC TGAGTT ACAGAAAT GAT TAC TAAGGAAGATT 
ACTCAGTAATTTGTTTAAAAAGTAATAAAATTCAACAAACATTTGCTGAATAGCTACTATATGTC 
AAGTGCT GT GCAAGGT AT TACAC T C T GTAAT TGAATAT TAT T CC TCAAAAAAT TGCACATAGT AG 
AACGCTATCTGGGAAGCTATTTTTTTCAGTTTTGATATTTCTAGCTTATCTACTTCCAAACTAAT 
TTT TAT T T T T GC T GAGAC TAAT C T T AT T CAT T T TC TC TAAT AT G GCAAC CAT T AT AAC C T TAAT T 
TATTATTAACATACC TAAGAAGTACAT TGTTACC TCTATATACCAAAGCACATT TTAAAAGTGCC 
ATTAACAAATGTATCACTAGCCCTCCTTTTTCCAACAAGAAGGGACTGAGAGATGCAGAAATATT 
T GTGACAAAAAAT TAAAGCAT T T AGAAAACT T 



in; I 



FIGURE 6 

MARCFSLVLLLTSIWTTRLLVQGSLRAEELSIQVSCRIMGITLVSKKANQQLNFTEAKEACRLLG 
L SLAGKDQVE TALKAS FET CS YGWVGDG FWI S RI S PNPKCGKNGVGVL I WKVP VSRQFAAYCYN 
SSDTWTNSCIPEIITTKDPIFNTQTATQTTEFIVSDSTYSVASPYSTIPAPTTTPPAPASTSIPR 
RKKLICVTEVFMETSTMSTETEPFVENKAAFKNEAAGFGGVPTALLVLALLFFGAAAGLGFCWK 
RYVKAFPFTNKNQQKEMIETKWKEEKANTDSNPNEESKKTDKNPEESKSPSKTTVRCLEAEV . 

Signal sequence: 



y, amino acids 1-16 

hzi Transmembrane domain: 



amino acids 235-254 



N-glycosylation site . 



W 
%! . 

W' 

f£j amino acids" 53-57, 130-134, 289-293 

. 8 ..... . .-. 

'J?| Casein kinase II phosphorylation site. 

amino acids 145-149, 214-218 



Tyrosine kinase phosphorylation site. 

amino acids 79-88 

N-myristoylation site. 

amino acids 23-29, 65-71, 234-240, 235-239, 249-255, 253-259 
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FIGURE 7 

CGCCGCGCTCCCGCACCCGCGGCCCGCCCACCGCGCCGCTCCCGCATCTGCACCCGCAGCCCGGC 
GGCCTCCCGGCGGGAGCGAGCAGATCCAGTCCGGCCCGCAGCGCAACTCGGTCCAGTCGGGGCGG 
CGGCTGCGGGCGCAGAGCGGAGATGCAGCGGCT.TGGGGCCACCCTGCTGTGCCTGCTGCTGGCGG 
CGGCGGTCCCCACGGCCCCCGCGCCCGCTCCGACGGCGACCTCGGCTCCAGTCAAGCCCGGCCCG 
GCTCTCAGCTACCCGCAGGAGGAGGCCACCCTCAATGAGATGTTCCGCGAGGTTGAGGAACTGAT 
GGAGGACACGCAGCACAAAT TGCGCAGCGCGGT GGAAGAGATGGAGGCAGAAGAAGC T GCT GCTA 
AAGC AT CAT C AGAAG T GAAC C T GGC AAAC T T AC C T C C C AGC T AT CAC AAT G AGACC AAC AC AG AC 
ACGAAGGT TGGAAATAATACCATCCATGTGCACCGAGAAAT TCACAAGATAACCAACAACCAGAC 
T GGACAAAT GGT CT T TT CAGAGACAGT T AT CACATC TGTGGGAGACGAAGAAGGCAGAAGGAGCC 
ACGAGTGCATCATCGACGAGGACTGTGGGCCCAGCATGTACTGCCAGTTTGCCAGCTTCCAGTAC 
ACCTGCCAGCCATGCCGGGGCCAGAGGATGCTCTGCACCCGGGACAGTGAGTGCTGTGGAGACCA 
GCTGTGTGTCTGGGGTCACTGCACCAAAATGGCCACCAGGGGCAGCAATGGGACCATCTGTGACA 
ACCAGAGGGACTGCCAGCCGGGGCTGTGCTGTGCCTTCCAGAGAGGCCTGCTGTTCCCTGTGTGC 
ACACCCCTGCCCGTGGAGGGCGAGCTTTGCCATGACCCCGCCAGCCGGCTTCTGGACCTCATCAC 
CTGGGAGCTAGAGCCTGATGGAGCCTTGGACCGATGCCCTTGTGCCAGTGGCCTCCTCTGCCAGC 
CCCACAGCCACAGCCTGGTGTATGTGTGCAAGCCGACCTTCGTGGGGAGCCGTGACCAAGATGGG 
GAGAT CC T GCT GCCCAGAGAGG TCCCC GATGAG TATGAAGT TGGCAGCT T CATGGAGGAGG T GCG 
CCAGGAGCTGGAGGACCTGGAGAGGAGCCTGACTGAAGAGATGGCGCTGGGGGAGCCTGCGGCTG 
CCGCCGCTGCACTGCTGGGAGGGGAAGAGATTTAGATCTGGACCAGGCTGTGGGTAGATGTGCAA 
TAGAAATAGCTAATTTATTTCCCCAGGTGTGTGCTTTAGGCGTGGGCTGACCAGGCTTCTTCCTA 
CATCTTCTTCCCAGTAAGTTTCCCCTCTGGCTTGACAGCATGAGGTGTTGTGCATTTGTTCAGCT 
Of CCCCCAGGCTGTTCTCCAGGCTTCAGAGTCTGGTGCTTGGGAGAGTCAGGCAGGGTTAAACTGCA 
pi GGAGCAGTTTGCCACCCCTGTCCAGATTATTGGCTGCTTTGCCTCTACCAGTTGGCAGACAGCCG 
2 TTTGTTCTACATGGCTTTGATAATTGTTTGAGGGGAGGAGATGGAAACAATGTGGAGTCTCCCTC 

TGAT T GGT T T TGGGGAAAT GTGGAGAAGAGTGCCC T GCT T T G CAAACAT CAACCTGGCAAAAATG 
CAACAAAT GAAT T T T CCACGCAG T TC T T T C CAT GGGCATAGGTAAGCT G TGCCTT CAGC T GT T GC 
ft! AGATGAAATGTTCTGTTCACCCTGCATTACATGTGTTTATTCATCCAGCAGTGTTGCTCAGCTCC 

TACCTCTGTGCCAGGGCAGCATTTTCATATCCAAGATCAATTCCCTCTCTCAGCACAGCCTGGGG 
AGGGGGTCATTGTTCTCCTCGTCCATCAGGGATCTCAGAGGCTCAGAGACTGCAAGCTGCTTGCC 
CAAGTCACACAGCTAGTGAAGACCAGAGCAGTTTCATCTGGTTGTGACTCTAAGCTCAGTGCTCT 
CTCCACTACCCCACACCAGCCTTGGTGCCACCAAAAGTGCTCCCCAAAAGGAAGGAGAATGGGAT 
T T T TC TT GAGGCATGCACAT CT GGAAT TAAGGTCAAAC T AAT T C T CACAT CCC TC TAAAAGT AAA 
CTACTGTTAGGAACAGCAGTGTTCTCACAGTGTGGGGCAGCCGTCCTTCTAATGAAGACAATGAT 
ATTGACACTGTCCCTCTTTGGCAGTTGCATTAGTAACTTTGAAAGGTATATGACTGAGCGTAGCA 
TACAGGT T AACC T GCAGAAACAG TACT T AGGTAATTG TAGGGCGAGGAT TATAAATGAAAT T T GC 
AAAATCAC T TAGCAGCAAC T GAAGACAAT TATCAACCACGT GGAGAAAATCAAACCGAGCAGGGC 
TGTGTGAAACATGGTTGTAATATGCGACTGCGAACACTGAACTCTACGCCACTCCACAAATGATG 
TTTTCAGGTGTCATGGACTGTTGCCACCATGTATTCATCCAGAGTTCTTAAAGTTTAAAGTTGCA 
CATGATTGTATAAGCATGCTTTCTTTGAGTTTTAAATTATGTATAAACATAAGTTGCATTTAGAA 
ATCAAGCATAAAT CAC T TCAACTGCAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 8 

MQRLGATLLCLLLAAAVPTAPAPAPTATSAPWPGPALSYPQEEATLNEMFREVEELMEDTQHKL 
RSAVEEME AEEAAAKAS SE WLANLP PS YHNE TNT DTKVGNNT I HVHRE I HKI TNNQTGQMVFSE 
TVITSVGDEEGRRSHECIIDEDCGPSMYCQFASFQYTCQPCRGQRMLCTRDSECCGDQLCVWGHC 
TKMATRGSNGT I CDNQRDCQPGLCCAFQRGLLFPVC T PLPVEGELCHDPASRLLDL I TWELEPDG 
ALDRCPCASGLLCQPHSHSLVYVCKPTFVGSRDQDGEILLPREVPDEYEVGSFMEEVRQELEDLE 

RSLTEEMALGE PAAAAAALLGGEE I 
Signal sequence: 



CfJ amino acids 1-19 



N-glycosylation site. 

amino acids 96-100, 106-110, 121-125, 204-208 
Casein kinase II phosphorylation site. 

amino acids 46-50, 67-71, 98-102, 135-139, 206-210, 312-316, 
327-331 



p:| N-myristoylation site. 

W amino acids 202-208, 217-223 



Amidation site. 

amino acids 140-144 
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FIGURE Q 



CGGACGCGTGGGCGGACGCGTGGGGGCTGTGAGAAAGTGCCAATAAATACATCATGCAACCCCAC 
GGCCCACCTTGTGAACTCCTCGTGCCCAGGGCTGATGTGCGTCTTCCAGGGCTACTCATCCAAAG 
GCCTAATCCAACGTTCTGTCTTCAATCTGCAAATCTATGGGGTCCTGGGGCTCTTCTGGACCCTT 
AACTGGGTACTGGCCCTGGGCCAATGCGTCCTCGCTGGAGCCTTTGCCTCCTTCTACTGGGCCTT 
CCACAAGCCCCAGGACATCCCTACCTTCCCCTTAATCTCTGCGTTCATCCGCACACTCCGTTACC 
ACACTGGGTCATTGGCATTTGGAGCCCTCATCCTGACCCTTGTGCAGATAGCCCGGGTCATCTTG 
GAGTATATTGACCACAAGCTCAGAGGAGTGCAGAACCCTGTAGCCCGCTGCATCATGTGCTGTTT 
CAAGTGCT GCCTCT GGTGTCTGGAAAAAT T TATCAAGTTCCTAAACCGCAATGCATACATCATGA 
TCGCCATCTACGGGAAGAATTTCTGTGTCTCAGCCAAAAATGCGTTCATGCTACTCATGCGAAAC 
ATTGTCAGGGTGGTCGTCCTGGACAAAGTCACAGACCTGCTGCTGTTCTTTGGGAAGCTGCTGGT 



ffl GGTCGGAGGCGTGGGGGTCCTGTCCTTCTTTTTTTTCTCCGGTCGCATCCCGGGGCTGGGTAAAG 



ACTTTAAGAGCCCCCACCTCAACTATTACTGGCTGCCCATCATGACCTCCATCCTGGGGGCCTAT 
GTCATCGCCAGCGGCTTCTTCAGCGTTTTCGGCATGTGTGTGGACACGCTCTTCCTCTGCTTCCT 



pj GGAAGACCTGGAGCGGAACAACGGCTCCCTGGACCGGCCCTACTACATGTCCAAGAGCCTTCTAA 

g " AGAT TCTGGGCAAGAAGAACGAGGCGCCCCCGGACAACAAGAAGAGGAAGAAGTGACAGCTCCGG 

Q CCCTGATCCAGGACTGCACCCCACCCCCACCGTCCAGCCATCCAACCTCACTTCGCCTTACAGGT 

U?. CTCCATTTTGTGGTAAAAAAAGGTTTTAGGCCAGGCGCCGTGGCTCACGCCTGTAATCCAACACT 

SI 



TTGAGAGGCTGAGGCGGGCGGATCACCTGAGTCAGGAGTTCGAGACCAGCCTGGCCAACATGGTG 
AAACCTCCGTCTCTATTAAAAATACAAAAATTAGCCGAGAGTGGTGGCATGCACCTGTCATCCCA 
GCTACTCGGGAGGCTGAGGCAGGAGAATCGCTTGAACCCGGGAGGCAGAGGTTGCAGTGAGCCGA 
GATCGCGCCACTGCACTCCAACCTGGGTGACAGACTCTGTCTCCAAAACAAAACAAACAAACAAA 
AAGAT TT TAT TAAAGATAT T T TGTTAACT C 
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FIGURE 10 



RTRGRTRGGCEKVPINTSCNPTAHLVNSSCPGLMCVFQGYSSKGLIQRSVFNLQIYGVLGLFWTL 
NWVLALGQCVLAGAFAS FYWAFHKPQDI PT FPL I SAFIRTLRYHTGSIAFGAL ILTLVQI ARVI L 
EYIDHKLRGVQNPVARCIMCCFKCCLWCLEKFIKFLNRNAYIMIAIYGKNFCVSAKNAFMLLMRN 
IVRVWLDKVTDLLLFFGKLLWGGVGVLSFFFFSGRIPGLGKDFKSPHLNYYWLPIMTSILGAY 
VIASGFFSVFGMCVDTLFLCFLEDLERNNGSLDRPYYMSKSLLKILGKKNEAPPDNKKRKK 

Important features: 
Transmembrane domains: 




N-glycosylation sites. 



amino acids 57-80 (type II), 110-126, 215-231, 254-274 



amino acids 16-20, 27-31, 289-293 



111 



Hypothetical YBR002c family proteins. 




Ammonium transporters proteins. 



amino acids 276-283 



amino acids 204-231 



N-myristoylation sites. 

amino acids 60-66, 78-84 



Amidation site. 



amino acids 306-310 
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FIGURE 11 

GCCCCGCGCCCGGCGCCGGGCGCCCGAAGCCGGGAGCCACCGCCATGGGGGCCTGCCTGGGAGCCTGC 
TCCCTGCTCAGCTGCGCGTCCTGCCTCTGCGGCTCTGCCCCCTGCATCCTGTGCAGCTGCTGCCCCGC 
CAGCCGCAACTCCACCGTGAGCCGCCTCATCTTCACGTTCTTCCTCTTCCTGGGGGTGCTGGTGTCCA 
TCATTATGCTGAGCCCGGGCGTGGAGAGTCAGCTCTACAAGCTGCCCTGGGTGTGTGAGGAGGGGGCC 
GGGATCCCCACCGTCCTGCAGGGCCACATCGACTGTGGCTCCCTGCTTGGCTACCGCGCTGTCTACCG 
CATGTGCTTCGCCACGGCGGCCTTCTTCTTCTTCTTTTTCACCCTGCTCATGCTCTGCGTGAGCAGCA 
GCCGGGACCCCCGGGCTGCCATCCAGAATGGGTTTTGGTTCTTTAAGTTCCTGATGCTGGTGGGCCTC 
ACCGTGGGTGCCTTCTACATCCCTGACGGCTCCTTCACCAACATCTGGTTCTACTTCGGCGTCGTGGG 
CTCCTTCCTCTTCATCCTCATCCAGCTGGTGCTGCTCATCGACTTTGCGCACTCCTGGAACCAGCGGT 
fe^ GGCTGGGCAAGGCCGAGGAGTGCGATTCCCGTGCCTGGTACGCAGGCCTCTTCTTCTTCACTCTCCTC 
M\ TTCTACTTGCTGTCGATCGCGGCCGTGGCGCTGATGTTCATGTACTACACTGAGCCCAGCGGCTGCCA 

CGAGGGCAAGGTCTTCATCAGCCTCAACCTCACCTTCTGTGTCTGCGTGTCCATCGCTGCTGTCCTGC 
%l CCAAGGTCCAGGACGCCCAGCCCAACTCGGGTCTGCTGCAGGCCTCGGTCATCACCCTCTACACCATG 
Ul TTTGTCACCTGGTCAGCCCTATCCAGTATCCCTGAACAGAAATGCAACCCCCATTTGCCAACCCAGCT 

m 

" H7 G GGC AAC GAGAC AGT T G T GGC AGG C C C C GAGG G CT ATGAGACC C AGT GG T G GGAT G CCC C GAG CAT T G 

|L TGGGCCTCATCATCTTCCTCCTGTGCACCCTGTTCATCAGTCTGCGCTCCTCAGACCACCGGCAGGTG 
H AACAGCCTGATGCAGACCGAGGAGTGCCCACCTATGCTAGACGCCACACAGCAGCAGCAGCAGCAGGT 
m GGCAGGCTGTGAGGGCCGGGCCTTTGACAACGAGCAGGACGGCGTCACCTACAGCTACTCCTTCTTCC 
fi| ACTTCTGCCTGGTGCTGGCCTCACTGCACGTCATGATGACGCTCACCAACTGGTACAAGCCCGGTGAG 
U ACCCGGAAGATGATCAGCACGTGGACCGCCGTGTGGGTGAAGATCTGTGCCAGCTGGGCAGGGCTGCT 
f M CCTCTACCTGTGGACCCTGGTAGCCCCACTCCTCCTGCGCAACCGCGACTTCAGCTGAGGCAGCCTCA 

CAGCCTGCCATCTGGTGCCTCCTGCCACCTGGTGCCTCTCGGCTCGGTGACAGCCAACCTGCCCCCTC 
CCCACACCAATCAGCCAGGCTGAGCCCCCACCCCTGCCCCAGCTCCAGGACCTGCCCCTGAGCCGGGC 
CTTCTAGTCGTAGTGCCTTCAGGGTCCGAGGAGCATCAGGCTCCTGCAGAGCCCCATCCCCCCGCCAC 
ACCCACACGGTGGAGCTGCCTCTTCCTTCCCCTCCTCCCTGTTGCCCATACTCAGCATCTCGGATGAA 
AGGGCTCCCTTGTCCTCAGGCTCCACGGGAGCGGGGCTGCTGGAGAGAGCGGGGAACTCCCACCACAG 
TGGGGCATCCGGCACTGAAGCCCTGGTGTTCCTGGTCACGTCCCCCAGGGGACCCTGCCCCCTTCCTG 
GACTTCGTGCCTTACTGAGTCTCTAAGACTTTTTCTAATAAACAAGCCAGTGCGTGTAAAAAAAA 
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FIGURE 12 

MGACLGACSLLSCASCLCGSAPCILCSCCPASRNSTVSRLIFTFFLFLGVLVSIIMLSPGVESQL 
YKLP WVCEEGAGI P T VLQGH I DCGS LLGYRAVYRMC FAT AAFFFFFFT LLMLCVS S SRDPRAAIQ 
NGFWFFKFL I LVGLTVGAFYI PDGS FTNI WFYFGWGS FLFIL IQLVLLI DFAHSWNQRWLGKAE 
ECDSRAWYAGLFFFTLLFYLLS IAAVALMFMYYTEPSGCHEGKVFI SLNLT FCVCVS IAAVLPKV 
QDAQPNSGLLQASVITLYTMFVTWSALSSIPEQKCNPHLPTQLGNETWAGPEGYETQWWDAPSI 
VGLIIFLLCTLFISLRSSDHRQVNSLMQTEECPPMLDATQQQQQQVAACEGRAFDNEQDGVTYSY 
SFFHFCLVLASLHVmTLTNWYKPGETRKMISTWTAVWVKICASWAGLLLYLWTLVAPLLLRNRD 

FS 

Signal sequence: 

amino acids 1-20 

Transmembrane domains: 

amino acids 40-58, 101-116, 134-150, 162-178, 206-223, 240-257, 
272-283, 324-340, 391-406, 428-444 
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FIGURE 13 

CGGGCCAGCCTGGGGCGGCCGGCCAGGAACCACCCGTTAAGGTGTCTTCTCTTTAGGGATGGTGA 
GGT T GGAAAAAGAC TC CT G TAACCC TCCTCCAGGATGAACCACCTGCCAGAAGACAT GGAGAACG 
CTCTCACCGGGAGCCAGAGCTCCCATGCTTCTCTGCGCAATATCCATTCCATCAACCCCACACAA 
CTCATGGCCAGGATTGAGTCCTATGAAGGAAGGGAAAAGAAAGGCATATCTGATGTCAGGAGGAC 
TTTCTGTTTGTTTGTCACCTTTGACCTCTTATTCGTAACATTACTGTGGATAATAGAGTTAAATG 
T GAATGGAGGC AT T GAGAACAC AT TAGAGAAGGAGGT GAT GCAGTATGACT AC TAT T C T T CAT AT 
TTTGATATATTTCTTCTGGCAGTTTTTCGATTTAAAGTGTTAATACTTGCATATGCTGTGTGCAG 
AC TGCGC C AT TGG T GGGCAAT AGCGT T GACAAC GGCAGT GACC AGT GCC T T T T T AC TAGCAAAAG 
TGATCCTTTCGAAGCTTTTCTCTCAAGGGGCTTTTGGCTATGTGCTGCCCATCATTTCATTCATC 
CTTGCCTGGATTGAGACGTGGTTCCTGGATTTCAAAGTGTTACCTCAAGAAGCAGAAGAAGAAAA 
CAGACTCCT GATAGT TCAGGAT GC T T CAGAGAGGGCAGCACT T ATACC T GGTGGT CT T T CT GATG 
GT CAGT TT TAT TCCCCTCC T GAATCC GAAGCAGGAT CT GAAGAAGCT GAAGAAAAACAGGACAG T 
GAGAAACC AC T T T T AGAAC T ATGAGT AC TACT T T TGT T AAAT GT GAAAAAC C C T C AC AGAAAGT C 
ATCGAGGCAAAAAGAGGCAGGCAGTGGAGTCTCCCTGTCGACAGTAAAGTTGAAATGGTGACGTC 
CACTGCTGGCTTTAT TGAACAGC TAATAAAGAT TTAT TTAT TGTAATACCTCACAAACGTT GTAC 
CATATCCATGCACATTTAGTTGCCTGCCTGTGGCTGGTAAGGTAATGTCATGATTCATCCTCTCT 
TCAGTGAGACT GAGCC T GATGTGT TAACAAATAGGT GAAGAAAGT C T T GT GC TGTATT CCTAATC 
AAAAGAC T TAAT AT AT T GAAG T AACACT T T T T TAG T AAGC AAGAT ACC T T T T TAT TT CAAT T CAC 
AGAATGGAATTTTTTTGTTTCATGTCTCAGATTTATTTTGTATTTCTTTTTTAACACTCTACATT 
TCCCTTGTTTTTTAACTCATGCACATGTGCTCTTTGTACAGTTTTAAAAAGTGTAATAAAATCTG 
ACATGTCAATGTGGCTAGTTTTATTTTTCTTGTTTTGCATTATGTGTATGGCCTGAAGTGTTGGA 
C T TGCAAAAGGGGAAGAAAGGAAT T GC GAAT ACATGT AAAATGTCACCAGACAT T TGTAT TAT TT 
TTATCATGAAATCATGTTTTTCTCTGATTGTTCTGAAATGTTCTAAATACTCTTATTTTGAATGC 
AC AAAAT G AC T T AAAC CAT T CAT AT CAT GTTTCCTTTGCGTT C AGC C AAT T T CAAT T AAAAT G AA 
C T AAAT T AAAAA 



FIGURE 14 



MNHLPEDMENALTGSQSSHASLRNIHSINPTQLMARIESYEGREKKGISDVRRTFCLFVTFDLLF 
VTLLW 1 1 ELNVNGG I ENTLEKEVMQ YDY YSS Y FD I FLLAV FRFKVL I LAYAVCRLRHWWAI ALT T 
AVTSAFLLAKVILSKLFSQGAFGYVLPIISFILAWIETWFLDFKVLPQEAEEENRLLIVQDASER 
AALIPGGLSDGQFYSPPESEAGSEEAEEKQDSEKPLLEL 

Important features of the protein: 
Signal peptide: 

amino acids 1-20 



o 
m 



Transmembrane domains: 



amino acids 54-72, 100-118, 130-144, 146-166 




N-myristoylation sites. 

amino acids 14-20, 78-84, 79-85, 202-208, 217-223 



APP ID=1 0063732 
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FIGURE 15 



ACTCGAACGCAGTTGCTTCGGGACCCAGGACCCCCTCGGGCCCGACCCGCCAGGAAAGACTGAGG 
CCGCGGCCTGCCCCGCCCGGCTCCCTGCGCCGCCGCCGCCTCCCGGGACAGAAGATGTGCTCCAG 
GGTCCCTCTGCTGCTGCCGCTGCTCCTGCTACTGGCCCTGGGGCCTGGGGTGCAGGGCTGCCCAT 
CCGGCTGCCAGTGCAGCCAGCCACAGACAGTCTTCTGCACTGCCCGCCAGGGGACCACGGTGCCC 
CGAGACGTGCCACCCGACACGGTGGGGCTGTACGTCTTTGAGAACGGCATCACCATGCTCGACGC 
AGGCAGCTTTGCCGGCCTGCCGGGCCTGCAGCTCCTGGACCTGTCACAGAACCAGATCGCCAGCC 
TGCCCAGCGGGGTCTTCCAGCCACTCGCCAACCTCAGCAACCTGGACCTGACGGCCAACAGGCTG 
CATGAAATCACCAATGAGACCTTCCGTGGCCTGCGGCGCCTCGAGCGCCTCTACCTGGGCAAGAA 
CCGCATCCGCCACATCCAGCCTGGTGCCTTCGACACGCTCGACCGCCTCCTGGAGCTCAAGCTGC 
AGGACAACGAGCTGCGGGCACTGCCCCCGCTGCGCCTGCCCCGCCTGCTGCTGCTGGACCTCAGC 
CACAACAGCCTCCTGGCCCTGGAGCCCGGCATCCTGGACACTGCCAACGTGGAGGCGCTGCGGCT 
M> GGCTGGTCTGGGGCTGCAGCAGCTGGACGAGGGGCTCTTCAGCCGCTTGCGCAACCTCCACGACC 
C| TGGATGTGTCCGACAACCAGCTGGAGCGAGTGCCACCTGTGATCCGAGGCCTCCGGGGCCTGACG 
fj CGCCTGCGGCTGGCCGGCAACACCCGCATTGCCCAGCTGCGGCCCGAGGACCTGGCCGGCCTGGC 
3J TGCCCTGCAGGAGCTGGATGTGAGCAACCTAAGCCTGCAGGCCCTGCCTGGCGACCTCTCGGGCC 
7 ! TCTTCCCCCGCCTGCGGCTGCTGGCAGCTGCCCGCAACCCCTTCAACTGCGTGTGCCCCCTGAGC 
W TGGTTTGGCCCCTGGGTGCGCGAGAGCCACGTCACACTGGCCAGCCCTGAGGAGACGCGCTGCCA 
%l CTTCCCGCCCAAGAACGCTGGCCGGCTGCTCCTGGAGCTTGACTACGCCGACTTTGGCTGCCCAG 
lyil CCACCACCACCACAGCCACAGTGCCCACCACGAGGCCCGTGGTGCGGGAGCCCACAGCCTTGTCT 
'ff] TCTAGCTTGGCTCCTACCTGGCTTAGCCCCACAGCGCCGGCCACTGAGGCCCCCAGCCCGCCCTC 
M# CACTGCCCCACCGACTGTAGGGCCTG.TCCCCCAGCCCCAGGACTGCCCACCGTCCACCTGCCTCA 
^ ATGGGGGCACATGCCACCTGGGGACACGGCACCACCTGGCGTGCTTGTGCCCCGAAGGCTTCACG 
£!! GGCCTGTACTGTGAGAGCCAGATGGGGCAGGGGACACGGCCCAGCCCTACACCAGTCACGCCGAG 
Iff GCCACCACGGTCCCTGACCCTGGGCATCGAGCCGGTGAGCCCCACCTCCCTGCGCGTGGGGCTGC 

AGCGCTACCTCCAGGGGAGCTCCGTGCAGCTCAGGAGCCTCCGTCTCACCTATCGCAACCTATCG 
GGCCCTGATAAGCGGCTGGTGACGCTGCGACTGCCTGCCTCGCTCGCTGAGTACACGGTCACCCA 
W? GCTGCGGCCCAACGCCACTTACTCCGTCTGTGTCATGCCTTTGGGGCCCGGGCGGGTGCCGGAGG 
C| GCGAGGAGGCCTGCGGGGAGGCCCATACACCCCCAGCCGTCCACTCCAACCACGCCCCAGTCACC 
HI CAGGCCCGCGAGGGGAACCTGCCGCTCCTCATTGCGCCCGCCCTGGCCGCGGTGCTCCTGGCCGC 

GCTGGCTGCGGTGGGGGCAGCCTACTGTGTGCGGCGGGGGCGGGCCATGGCAGCAGCGGCTCAGG 
ACAAAGGGCAGGTGGGGCCAGGGGCTGGGCCCCTGGAACTGGAGGGAGTGAAGGTCCCCTTGGAG 
CCAGGCCCGAAGGCAACAGAGGGCGGTGGAGAGGCCCTGCCCAGCGGGTCTGAGTGTGAGGTGCC 
ACTCATGGGCTTCCCAGGGCCTGGCCTCCAGTCACCCCTCCACGCAAAGCCCTACATCTAAGCCA 
GAGAGAGACAGGGCAGCTGGGGCCGGGCTCTCAGCCAGTGAGATGGCCAGCCCCCTCCTGCTGCC 
ACACCACGTAAGTTCTCAGTCCCAACCTCGGGGATGTGTGCAGACAGGGCTGTGTGACCACAGCT 
GGGCCCTGTTCCCTCTGGACCTCGGTCTCCTCATCTGTGAGATGCTGTGGCCCAGCTGACGAGCC 
CTAACGTCCCCAGAACCGAGTGCCTATGAGGACAGTGTCCGCCCTGCCCTCCGCAACGTGCAGTC 
CCTGGGCACGGCGGGCCCTGCCATGTGCTGGTAACGCATGCCTGGGTCCTGCTGGGCTCTCCCAC 
TCCAGGGGGACCCTGGGGGCCAGTGAAGGAAGCTCCCGGAAAGAGCAGAGGGAGAGCGGGTAGGC 
GGCTGTGTGACTCTAGTCTTGGCCCCAGGAAGCGAAGGAACAAAAGAAACTGGAAAGGAAGATGC 
T T T AGGAACAT GTT TTGCTTTTT TAAAATATAT ATAT T T ATAAGAGAT CC T T TCCCAT TT AT T C T 
GGGAAGATGTTT TTCAAACTCAGAGACAAGGACT TTGGTT TTTGTAAGACAAACGATGATATGAA 
GGCC T T T T GT AAGAAAAAAT AAAAGATGAAGT GT GAAA 



APP ID=1 0063732 
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FIGURE 16 

MCSRVPLLLPLLLLLALGPGVQGCPSGCQCSQPQTVFCTARQGTTVPRDVPPDTVGLYVFENGI.T 
MLDAGS FAGLPGLQLLDLSQNQIAS LPSGVFQPLANLSNLDLTANRLHE I TNETFRGLRRLERLY 
LGKNRIRHIQPGAFDTLDRLLELKLQDNELRALPPLRLPRLLLLDLSHNSLIALEPGILDTANVE 
ALRLAGLGLQQLDEGLFSRLRNLHDLDVSDNQLERVPPVIRGLRGLTRLRLAGNTRIAQLRPEDL 
AGLAALQELDVSNLSLQALPGDLSGLFPRLRLLAAARNPFNCVCPLSWFGPWVRESHVTLASPEE 
TRCHFPPKNAGRLLLELDYADFGCPATTTTATVPTTRPWREPTALSSSLAPTWLSPTAPATEAP 
SPPSTAPPTVGPVPQPQDCPPSTCLNGGTCHLGTRHHLACLCPEGFTGLYCESQMGQGTRPSPTP 
VTPRPPRSLTLGIEPVSPTSLRVGLQRYLQGSSVQLRSLRLTYRNLSGPDKRLVTLRLPASLAEY 
TVTQLRPNAT YSVCVMPLGPGRVPEGEEACGEAHTP PAVHSNHAPVTQARE GNL PLLI APALAAV 



R LLAALAAVGAAYCTORGRAMAA^QDKGQVGPGAGPLELEGVKVPLEPGPKATEGGGEALPSGSE 
UK CEVPLMGFPGPGLQSPLHAKPYI " 



Important features: 
Signal peptide: 

n amino acids 1-23 

M Transmembrane domain: 

f 33 



amino acids 579-599 

EGF-like domain cysteine pattern signature. 
3 amino acids 430-442 

Leucine zipper pattern, 
amino acids 197-219, 269-291 
N-glycosylation sites. 

amino acids 101-105, 117-121, 273-277, 500-504, 528-532 
Tyrosine kinase phosphorylation sites . 
amino acids 124-131, 337-345 
N-myristoylation sites . 

amino acids 23-29, 27-33, 70-76, 142-148, 187-193, 348-354, 
594-600, 640-646 



APP_ID=1 0063732 
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FIGURE 17 



GCAGCGGCGAGGCGGCGGTGGTGGCTGAGTCCGTGGTGGCAGAGGCGAAGGCGACAGCTC ATGC G 
GGTCCGGATAGGGCTGACGCTGCTGCTGTGTGCGGTGCTGCTGAGCTTGGCCTCGGCGTCCTCGG 
ATGAAGAAGGCAGC CAGGAT GAAT CC T TAGAT TCCAAGAC TAC T TT GACAT CAGAT GAGT CAGTA 
AAGGAC CAT AC T ACTGCAGGCAGAGTAGT TGC T GGT CAAATAT T T C T T GAT TCAGAAGAAT CT GA 
AT T AGAAT CCTCTAT TCAAGAAGAGGAAGACAGCC T CAAGAGCCAAGAGGGGGAAAGTGT CACAG 
AAGATAT CAGC T T TC TAGAGTCTCCAAAT CCAGAAAACAAGGACTATGAAGAGCCAAAGAAAGTA 
CGGAAACCAGCTTTGACCGCCATTGAAGGCACAGCACATGGGGAGCCCTGCCACTTCCCTTTTCT 
TTTCCTAGATAAGGAGTATGATGAATGTACATCAGATGGGAGGGAAGATGGCAGACTGTGGTGTG 
C TACAACCTAT GAC TACAAAGCAGATGAAAAGTGGGGCT T T TGTGAAACT GAAGAAGAGGCT GC T 
AAGAGAC GGCAGAT GCAGGAAGCAGAAAT GAT G TATCAAACT GGAAT GAAAAT CC T TAAT GGAAG 



|4 



W P CAATAAGAAAAGCCAAAAAAGAGAAGCAT AT CGGTAT CT CCAAAAGGCAGCAAGC AT GAAC CAT A 

W 
"Hi 



CCAAAGC CCT GGAGAGAGT GTCATATGC T CT T T TAT T T GGT GAT TAC T TGCCACAGAAT AT CCAG 
GCAGCGAGAGAGATGTTTGAGAAGCTGACTGAGGAAGGCTCTCCCAAGGGACAGACTGCTCTTGG 
fy CTTTCTGTATGCCTCTGGACTTGGTGTTAATTCAAGTCAGGCAAAGGCTCTTGTATATTATACAT 
ft TTGGAGCTCTTGGGGGCAATCTAATAGCCCACATGGTTTTGGTAAGTAGACTTTAGTGGAAGGCT 
W AATAATATTAACATCAGAAGAATTTGTGGTTTATAGCGGCCACAACTTTTTCAGCTTTCATGATC 



SIM 



m 



CAGAT T T GCT T G TAT TAAGAC CAAATAT T CAGT T GAACT T CCT T CAAAT TC TT G T TAATGGATAT 
AACACAT GGAATCTACAT GT AAATGAAAGT T GGTGGAGTCC ACAAT T TT TCTT TAAAATGAT TAG 
TTTGGCTGATTGCCCCTAAAAAGAGAGATCTGATAAATGGCTCTTTTTAAATTTTCTCTGAGTTG 
GAAT T G T C AGAAT CAT T T T T TACAT TAGAT TAT CATAAT T T TAAAAAT T T T T C T T T AGT T T T T CA 
AAAT T T T G TAAATGGT GGC T AT AGAAAAACAACAT GAAAT AT T ATAC AAT AT T T T GCAAC AAT GC 
CCTAAGAATTGT TAAAAT TCATGGAGTTATTTGTGCA.GAATGAC TCCAGAGAGCTCTACTT TCTG 
TTTTTTACTTTTCATGATTGGCTGTCTTCCCATTTATTCTGGTCATTTATTGCTAGTGACACTGT 
GCCTGCTTCCAGTAGTCTCATTTTCCCTATTTTGCTAATTTGTTACTTTTTCTTTGCTAATTTGG 
AAGAT T AAC T CAT T T T T AATAAAAT T AT GT CT AAGAT T AAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



APPJD=1 0063732 
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FIGURE 18 



pi 

•Sip. ;,' 



MRVRIGLTLLLCAVLLSLASASSDEEGSQDESLDSKTTLTSDESVKDHTTAGRWAGQIFLDSEESEL 
ESSIQEEEDSLKSQEGESVTEDISFLESPNPENKDYEEPKKVRKPALTAIEGTAHGEPCHFPFLFLDK 

EYDECTSDGREDGRL WC AT T YD YKADE KWGF CE TEE E AAKRRQMQEAEMM Y Q TGMKI LNG S NKK S QKR 
EAYRYLQKAASMNHTKALERVSYALLFGDYLPQNIQAAREMFEKLTEEGSPKGQTALGFLYASGLGVN 
S S QAKALVY YT FGALGGNL IAHMVLVS RL 

Important features: 
Signal peptide: 

amino acids 1-21 



fs% N-glycosylation sites. 



amino acids 195-199, 217-221, 272-276 

Tyrosine kinase phosphorylation site. 

amino acids 220-228 



s'p N-myristoylation sites. 

ft amino acids 120-126, 253-259, 268-274, 270-274, 285-291, 289-295 



Glycosaminoglycan attachment site. 

amino acids 267-271 

Microbodies C- terminal targeting signal. 

amino acids 299-303 

Type II fibronectin collagen-binding domain protein, 

amino acids 127-169 



Fructose-bisphosphate aldolase class-II protein. 

amino acids 101-119 



APPJEN1 0063732 



Page 172 of 322 



FIGURE 1Q 



AAT TCAGAT T T TAAGCCC AT TC TGCAG T GGAAT T TCAT GAAC TAGCAAGAGGAC AC CAT C T T CT T 

GTATTATACAAGAAAGGAGTGTACCTATCACACACAGGGGGAAAAATGCT CT T T TGGGTGCTAGG 

CCTCCTAATCCTCTGTGGTTTTCTGTGGACTCGTAAAGGAAAACTAAAGATTGAAGACATCACTG 

ATAAGTACATT T T TATCACTGGATGTGACTCGGGC TT TGGAAACT TGGCAGCCAGAACTTT TGAT 

AAAAAGGGATTTCATGTAATCGCTGCCTGTCTGACTGAATCAGGATCAACAGCTTTAAAGGCAGA 

AACCTCAGAGAGACTTCGTACTGTGCTTCTGGATGTGACCGACCCAGAGAATGTCAAGAGGACTG 

CCCAGTGGGTGAAGAACCAAGTTGGGGAGAAAGGTCTCTGGGGTCTGATCAATAATGCTGGTGTT 

CCCGGCGTGCTGGCTCCCACTGACTGGCTGACACTAGAGGACTACAGAGAACCTATTGAAGTGAA 

CCT G TTT GGAC TCAT CAG T GT GACAC T AAAT ATGC T T CCT T TGGTCAAGAAAGCT CAAGGGAGAG 

TTATTAATGTCTCCAGTGTTGGAGGTCGCCTTGCAATCGTTGGAGGGGGCTATACTCCATCCAAA 

TATGCAGTGGAAGGTTTCAATGACAGCTTAAGACGGGACATGAAAGCTTTTGGTGTGCACGTCTC 

ATGCATTGAACCAGGATTGTTCAAAACAAACTTGGCAGATCCAGTAAAGGTAATTGAAAAAAAAC 

TCGCCAT T TGGGAGCAGCT GTCTCCAGACAT CAAACAACAATAT GGAGAAGGT TACATT GAAAAA 

AGTCTAGACAAACTGAAAGGCAATAAATCCTATGTGAACATGGACCTCTCTCCGGTGGTAGAGTG 

CATGGACCACGCTCTAACAAGTCTCTTCCCTAAGACTCATTATGCCGCTGGAAAAGATGCCAAAA 

TTTTCTGGATACCTCTGTCTCACATGCCAGCAGCTTTGCAAGACTTTTTATTGTTGAAACAGAAA 

GCAGAGCT GGC TAATCCCAAGGCAG T GTGACT CAGC TAAC CACAAAT GTCT CC T CCAGGCTATGA 

AATTGGCCGATTTCAAGAACACATCTCCTTTTCAACCCCATTCCTTATCTGCTCCAACCTGGACT 

CATTTAGATCGTGCTTATTTGGATTGCAAAAGGGAGTCCCACCATCGCTGGTGGTATCCCAGGGT 

CCCTGCTCAAGTTTTCTTTGAAAAGGAGGGCTGGAATGGTACATCACATAGGCAAGTCCTGCCCT 

GTATTTAGGCTTTGCCTGCTTGGTGTGATGTAAGGGAAATTGAAAGACTTGCCCATTCAAAATGA 

TCTTTACCGTGGCCTGCCCCATGCTTATGGTCCCCAGCATTTACAGTAACTTGTGAATGTTAAGT 

AT CATC TC T T AT CTAAAT AT TAAAAGAT AAGT CAACCCAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAA 



CI 

ffi 



fa. ? 



111 



m amino acids 39-42 



m 



FIGURE 20 

ML FWVLGLL I LCGFLWTRKGKLK IEDI T DKY I FI TGCDS G FGNLAART FDKKGFHVI AACLTE S G 
STALKAETSERLRTVLLDVTDPE3ST^RTAQWVKNQVGEKGLWGLINNAGVPGVLAPTDWLTLEDY 
REPIEVNLFGLISVTLNMLPLVKKAQGRVINVSSVGGRLAIVGGGYTPSKYAVEGFNDSLRRDMK 
AFGVHVS C I E PGL FKTNLADP VKVI EKKLAI WEQL S PD I KQQYGEGY I EKSLDKLKGNKS YVNMD 
LSPVVECMDHALTSLFPKTHYAAGKDAKIFWIPLSHMPAALQDFLLLKQKAELANPKAV 

Important features of the protein: 
Signal peptide: 

amino acids 1-17 

Transmembrane domain: 

amino acids 136-152 

N-glycosylation sites . 

amino acids 161-163, 187-190 and 253-256 
Glycosaminoglycan attachment site. 



N-myristoylation sites. 

amino acids 36-41, 42-47, 108-113, 166-171, 198-203 and 207-212 



APPJD=1 0063732 
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FIGURE 21 



CTGAGGCGGCGGTAGCATGGAGGGGGAGAGTACGTCGGCGGTGCTCTCGGGCTTTGTGCTCGGCG 
CACTCGCTTTCCAGCACCTCAACACGGACTCGGACACGGAAGGTTTTCTTCTTGGGGAAGTAAAA 
GGTGAAGCCAAGAACAGCATTACTGATTCCCAAATGGATGATGTTGAAGTTGTTTATACAATTGA 
CATTCAGAAATATATTCCATGCTATCAGCTTTTTAGCTTTTATAATTCTTCAGGCGAAGTAAATG 
AGCAAGCAC T GAAGAAAATAT TAT CAAAT GT CAAAAAGAATG TGGTAGGT TGGTACAAAT TCCGT 
CGTCAT TCAGATCAGATCAT GACGT T T AGAGAGAGGC TGCT T CACAAAAAC T TGCAGGAG CAT T T 
T TCAAACCAAGACCT T GT T TTTC TGCTAT TAACACCAAGTATAATAACAGAAAGCTGCTCTACTC 
ATCGACT GGAACAT TCC TT ATATAAACC T CAAAAAGGACT T T T TCACAGGGTACCT T TAG T GGT T 
GCCAATCTGGGCATGTCTGAACAACTGGGTTATAAAACTGTATCAGGTTCCTGTATGTCCACTGG 
T T TTAGCCGAGCAGTACAAACACACAGCTCT AAATTT TT TGAAGAAGAT GGATCCT TAAAGGAGG 
TACATAAGAT AAATGAAATGTATGC TT CATT ACAAGAGGAAT TAAAGAGT ATAT GCAAAAAAGT G 
GAAGAC AG T GAAC AAG C AG T AGAT AAAC T AGT AAAGGAT G T AAAC AGAT T AAAAC GAGAAAT T GA 
GAAAAGGAGAGGAGCACAGATTCAGGCAGCAAGAGAGAAGAACATCCAAAAAGACCCTCAGGAGA 
ACATTTTTCTTTGTCAGGCATTACGGACCTTTTTTCCAAATTCTGAATTTCTTCATTCATGTGTT 
ATGTCTTTAAAAAATAGACATGTTTCTAAAAGTAGCTGTAACTACAACCACCATCTCGATGTAGT 
AGACAATCTGACCTTAATGGTAGAACACACTGACATTCCTGAAGCTAGTCCAGCTAGTACACCAC 
AAATCATTAAGCATAAAGCCTTAGACTTAGATGAGAGATGGCAATTCAAGAGATCTCGGTTGTTA 
GATACACAAGACAAACGATCTAAAGCAAATACTGGTAGTAGTAACCAAGATAAAGCATCCAAAAT 
GAGCAGCCCAGAAACAGATGAAGAAATTGAAAAGATGAAGGGTTTTGGTGAATATTCACGGTCTC 
CTACAT T T TGA TCC T TT TAAC C T TACAAGGAGAT T T T T TTATT TGGC TGAT GGGTAAAGCCAAAC 
ATTTCTATTGTTTTTACTATGTTGAGCTACTTGCAGTAAGTTCATTTGTTTTTACTATGTTCACC 
TGT T TGCAGT AATACACAGATAAC TC T T AGT GCAT TTACT TCACAAAGTAC T T T TT CAAACATCA 
GATGCTTTTATTTCCAAACCTTTTTTTCACCTTTCACTAAGTTGTTGAGGGGAAGGCTTACACAG 
ACACATTCTTTAGAATTGGAAAAGTGAGACCAGGCACAGTGGCTCACACCTGTAATCCCAGCACT 
TAGGGAAGACAAGT CAGGAGGAT TGAT TGAAGC TAGGAGT TAGAGAC CAGC CTGGGCAACGTAT T 
GAGACCAT GT C TAT TAAAAAATAAAAT GGAAAAGCAAGAAT AGCC T TAT T T TCAAAATATGGAAA 
GAAAT T TAT AT GAAAAT T TAT C T GAGT C AT TAAAAT T C T CC T T AAG T GAT ACT T T T T T AGAAGTA 
CAT TATGGCTAGAG T TGCCAGATAAAAT GCTGGATAT CAT GCAATAAAT T T GCAAAACAT CAT C T 
AAAATTTAAAAAAAAAAAAAAAAAAAAAA 



FIGURE 22 



MEGESTSAVLSGFVLGALAFQHLNTDSDTEGFLLGEVKGEAKNSITDSQMDDVEWYTIDIQKYI 
PCYQLFSFYNSSGEVNEQALKKILSNVKKNWGWYKFRRHSDQIMTFRERLLHKNLQEHFSNQDL 
VFLLLTPS I ITESCSTHRLEHSLYKPQKGLFHRVPLWANLGMSEQLGYKTVSGSCMSTGFSRAV 
QTHSSKFFEEDGSLKEVHKINEMYASLQEELKSICKKVEDSEQAVDKLVKDVNRLKREIEKRRGA 
Q I QAAREKN I QKDPQEN I FLC QALRT FFPNS E FLHS CVMS LKNRHVSKS S CNYNHHLDWDNLTL 
MVEHTDIPEASPASTPQIIKHKALDLDDRWQFKRSRLLDTQDKRSKANTGSSNQDKASKMSSPET 

DEE IEKMKG FGEYSRS PT F 




Important features: 
Signal peptide: 



amino acids 1-19 



N-glycosylation sites. 

amino acids 75-79, 322-326 




m 



N-myristoylation site . 



Growth factor and cytokines receptors family. 



amino acids 184-154 



amino acids 134-150 



APP ID=1 0063732 
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FIGURE 2ft 



GGCACAGCCGCGCGGCGGAGGGCAGAGTCAGCCGAGCCGAGTCCAGCCGGACGAGCGGACCAGCGCAGGGCAGCCCAA 
GCAGCGCGCAGCGAACGCCCGCCGCCGCCCACACCCTCTGCGGTCCCCGCGGCGCCTGCCACCCTTCCCTCCTTCCCC 
GCGTCCCCGCCTCGCCGGCCAGTCAGCTTGCCGGGTTCGCTGCCCCGCGAAACCCCGAGGTCACCAGCCCGCGCCTCT 
GCTTCCCTGGGCCGCGCGCCGCCTCCACGCCCTCCTTCTCCCCTGGCCCGGCGCCTGGCACCGGGGACCGTTGCCTGA 
CGCGAGGCCCAGCTCTACTTTTCGCCCCGCGTCTCCTCCGCCTGCTCGCCTCTTCCACCAACTCCAACTCCTTCTCCC 
TCCAGCTCCACTCGCTAGTCCCCGACTCCGCCAGCCCTCGGCCCGCTGCCGTAGCGCCGCTTCCCGTCCGGTCCCAAA 
GGTGGGAACGCGTCCGCCCCGGCCCGCACCATGGCACGGTTCGGCTTGCCCGCGCTTCTCTGCACCCTGGCAGTGCTC 
AGCGCCGCGCTGCTGGCTGCCGAGCTCAAGTCGAAAAGTTGCTCGGAAGTGCGACGTCTTTACGTGTCCAAAGGCTTC 
AAC AAGAACGAT GCC CCCCT CCAC GAGAT CAACG GT GAT CATTT GAAGATCT GT C CCCAGGGTT CTACCT GCTGCTCT 
CAAGAGAT GGAGGAGAAGTACAGC CT GCAAAGTAAAGAT GAT TT CAAAAGTGT G GT CAGC GAACAGT GCAAT CATTT G 
CAAGCTGTCTTTGCTTCACGTTACAAGAAGTTTGATGAATTCTTCAAAGAACTACTTGAAAATGCAGAGAAATCCCTG 
AATGATAT GTTT GT GAAGAC ATAT G GC CAT TT AT ACAT GCAAAATTCT GAGCTATTTAAAGAT CT CTT C GTAGAGTT G 
AAACGTTACTACGT GGT GGGAAAT GTGAAC CT GGAAGAAATGCTAAAT GACT T CT GGGCT CGCCT CCT GGAGCGGAT G 
TTCCGCCTGGT GAACTCCCAGTAC CACTT TACAGATGAGT AT CT GGAAT GTGT GAGCAAGTATAC GGAGCAGCT GAAG 
CCCTTCGGAGATGTCCCTCGCAAATTGAAGCTCCAGGTTACTCGTGCTTTTGTAGCAGCCCGTACTTTCGCTCAAGGC 
TT AG C GGTT GCGGGAGAT GT CGT GAGCAAGGT CT C CGT G GTAAACCC C ACAGC CCAGT GTAC C CATGCCCT GT T GAAG 
AT GAT CTACT GC T CC CACT GCC GGGGT CT C GT GACT GT GAAGC CAT GT T ACAACTACT GCTCAAACAT CAT GAGAGGC 
TGTT T GGCCAACCAAGGGGATCT C GAT TTT GAAT GGAACAATT T CATAGATGCTATGCT GAT GGT GGCAGAGAGG CT A 
GAGGGTCCTTTCAACATTGAATCGGTCATGGATCCCATCGATGTGAAGATTTCTGATGCTATTATGAACATGCAGGAT 
AATAGTGTTCAAGTGTCTCAGAAGGTTTTCCAGGGATGTGGACCCCCCAAGCCCCTCCCAGCTGGACGAATTTCTCGT 
T CCAT CT CT GAAAGT GC CT T CAGTGCT C GCTT CAGACCACAT CAC CCC GAGGAAC GCC CAAC CACAG CAGCT GGCACT 
AGTTTGGACCGACTGGTTACTGATGTCAAGGAGAAACTGAAACAGGCCAAGAAATTCTGGTCCTCCCTTCCGAGCAAC 
GT TT GCAAC GAT GAGAGGAT GGCTGCAGGAAACGG CAAT GAGGAT GACT GTT GGAAT GG GAAAGGCAAAAG CAGGTAC 
CTGTTTGCAGTGACAGGAAATGGATTAGCCAACCAGGGCAACAACCCAGAGGTCCAGGTTGACACCAGCAAACCAGAC 
ATACT GAT CCTT C GT CAAAT CAT GGCT CTT CGAGT GAT GACCAGCAAGAT GAAGAAT GCATACAATGGGAAC GACGT G 
GACTT CTTT GATAT CAGT GAT GAAAGT AGT GGAGAAGGAAGT GGAAGT GG CT GTGAGTAT CAGCAGT GCC CTT CAGAG 
TTTGACTACAATGCCACTGACCATGCTGGGAAGAGTGCCAATGAGAAAGCCGACAGTGCTGGTGTCCGTCCTGGGGCA 
CAGGCCTACCTCCTCACTGTCTTCTGCATCTTGTTCCTGGTTATGCAGAGAGAGTGGAGATAATTCTCAAACTCTGAG 
AAAAAGT GT T CAT CAAAAAGT TAAAAGG CAC CAGT TAT CACT TTT C TACCAT C CT AG T GACTTTGCTT TTT AAAT GAA 
T GGAC AACAAT GTACAGTTTT TACTAT GT GG C CACT GGTTTAAGAAGT GCT GACT T T GTTT T CTCATT CAGTT TT GGG 
AGGAAAAGGGACT GT GCATT GAGTT GGT T CCT GCT C CCCCAAACCAT GTTAAACGT GGCTAACAGT GT AGGTACAGAA 
CTAT AGT T AGT T GTG CAT TT GT GAT T TTAT CACTCTAT TAT TT GT TT GT AT GT TTT TTTCT CATTT C GTTT GT GG GT T 
TTTTTTTCCAACTGTGATCTCGCCTTGTTTCTTACAAGCAAACCAGGGTCCCTTCTTGGCACGTAACATGTACGTATT 
T CT GAAATATTAAATAGCTGT ACAGAAGCAGGT TT TAT TTAT CAT GT TAT CT TATTAAAAGAAAAAGCCCAAAAAGC 



FIGURE 24 

MARFGLPALLCTIAVLSAALLAAELKSKSCSEVRRLYVSKGFNKNDAPLHEINGDHLKIGPQGST 
CCSQEMEEKYSLQSKDDFKSWSEQCNHLQAVFASRYKKFDEFFKELLENAEKSLNDMFVKTYGH 
LYMQNSELFKDLFTOLKRYYWGNTOLEEMLNDFWARLLERMFRLWSQYHFTDEYLECVSKYTE 
QLKPFGDVPRKLKLQVTRAFVAARTFAQGLAVAGDWSKVSWNPTAQCTHALLKMIYCSHCRGL 
VTVKPCYNYCSNIMRGCLANQGDLDFEWNNFIDAMLMVAERLEGPFT^IESVMDPIDVKISDAIMN 
MQDNSVQVSQKVFQGCGPPKPLPAGRISRSISESAFSARFRPHHPEERPTTAAGTSLDRLVTDVK 
EKLKQAKKFWSSLPSNVCNDERMAAGNGNEDDCWNGKGKSRYLFAVTGNGLANQGNNPEVQVDTS 
KPDILILRQIMALRVMTSKMKNAYNGNDVDFFDISDESSGEGSGSGCEYQQCPSEFDYNATDHAG 
KSANEKADSAGVRPGAQAYLLTVFC ILFLVMQREWR 

Important features : 
Signal peptide: 

amino acids 1-22 

ATP/GTP-binding site motif A (P-loop) . 

amino acids 515-524 

N-glycosylation site. 

amino acids 514-518 

Glycosaminoglycan attachment sites. 

amino acids 494-498, 498-502 

N-myristoylation sites. 

amino acids 63-69, 224-230, 276-282, 438-444, 497-503, 531-537 
Glypicans proteins . 

amino acids 54-75, 105-157, 238-280, 309-346, 423-460, 468-506 
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FIGURE 25 

CTCGCCCTCAAATGGGAACGCTGGCCTGGGACTAAAGCATAGACCACCAGGCTGAGTATCCTGAC 
CTGAGTCATCCCCAGGGATCAGGAGCCTCCAGCAGGGAACCTTCCATTATATTCTTCAAGCAACT 
TACAGCTGCACCGACAGTTGCGATGAAAGTTCTAATCTCTTCCCTCCTCCTGTTGCTGCCACTAA 
TGCTGATGTCCATGGTCTCTAGCAGCCTGAATCCAGGGGTCGCCAGAGGCCACAGGGACCGAGGC 
CAGGCTTCTAGGAGATGGCTCCAGGAAGGCGGCCAAGAATGTGAGTGCAAAGATTGGTTCCTGAG 
AGCCCCGAGAAGAAAATTCATGACAGTGTCTGGGCTGCCAAAGAAGCAGTGCCCCTGTGATCATT 
TCAAGGGCAATGTGAAGAAAAGAAGACACCAAAGGCACCACAGAAAGCCAAACAAGCAT T CCAGA 
GCCTGCCAGCAATTTCTCAAACAATGTCAGCTAAGAAGCTTTGCTCTGCCTTTGTAGGAGCTCTG 
AGCGCCCACTCTTCCAATTAAACATTCTCAGCCAAGAAGACAGTGAGCACACCTACCAGACACTC 
TTCTTCTCCCACCTCACTCTGCCACTGTACCCACCCCTAAATCATTCCAGTGCTCTCAAAAAGCA 
TGTTTTTCAAGATCATTTTGTTTGTTGCTCTCTCTAGTGTCTTCTTCTCTCGTCAGTCTTAGCCT 
GTGCCCTCCCCTTACCCAGGCTTAGGCTTAATTACCTGAAAGATTCCAGGAAACTGTAGCTTCCT 
AGCTAGT GTCAT T TAACCTT AAATGCAAT CAGGAAAGT AGCAAACAGAAG TCAAT AAAT AT TT T T 
AAATGTCAAAAAAAAAAAAAA?\AAA 
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FIGURE 26 

MKVLISSLLLLLPLMLMSMVSSSLNPGVARGHRDRGQASRRWLQEGGQECECKDWFLRAPRRKFM 
TVS GL PKKQCPCDH FKGMVKKTRHQRHHRKPNKH S RACQQ FLKQC QLRS FAL P L 

Important features : 
Signal peptide: 

amino acids 1-22 

N-myristoylation sites. 

amino acids 27-33, 46-52 
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FIGURE 27 



GGACGCCAGCGCCTGCAGAGGCTGAGCAGGGAAAAAGCCAGTGCCCCAGCGGAAGCACAGCTCAG 
AGCTGGTCTGCCATGGACATCCTGGTCCCACTCCTGCAGCTGCTGGTGCTGCTTCTTACCCTGCC 
CCTGCACCTCATGGCTCTGCTGGGCTGCTGGCAGCCCCTGTGCAAAAGCTACTTCCCCTACCTGA 
TGGCCGTGCTGACTCCCAAGAGCAACCGCAAGATGGAGAGCAAGAAACGGGAGC TC TTCAGCCAG 
ATAAAGGGGCTTACAGGAGCCTCCGGGAAAGTGGCCCTACTGGAGCTGGGCTGCGGAACCGGAGC 
CAACTTTCAGTTCTACCCACCGGGCTGCAGGGTCACCTGCCTAGACCCAAATCCCCACTTTGAGA 
AGTTCCTGACAAAGAGCATGGCTGAGAACAGGCACCTCCAATATGAGCGGTTTGTGGTGGCTCCT 
GGAGAGGACATGAGACAGCTGGCTGATGGCTCCATGGATGTGGTGGTCTGCACTCTGGTGCTGTG 

it 

CTCTGTGCAGAGCCCAAGGAAGGTCCTGCAGGAGGTCCGGAGAGTACTGAGACCGGGAGGTGTGC 



ill 
III 



;■ " 



TCTTTTTCT GGGAGCAT G T GGCAGAACCAT AT GGAAGC T GGGC CT TCAT G T GGCAGCAAGT T T TC 



m GAGCCCACCTGGAAACACATTGGGGATGGCTGCTGCCTCACCAGAGAGACCTGGAAGGATCTTGA 
faj GAACGCCCAGTTCTCCGAAATCCAAATGGAACGACAGCCCCCTCCCTTGAAGTGGCTACCTGTTG 
% J GGCCCCACATCATGGGAAAGGCTGTCAAACAATCTTTCCCAAGCTCCAAGGCACTCATTTGCTCC 

TTCCCCAGCCTCCAATTAGAACAAGCCACGCACCAGCCTATCTATCTTCCACTGAGAGGGACCTA 
GCAGAATGAGAGAAGACATTCATGTACCACCTACTAGTCCCTCTCTCCCCAACCTCTGCCAGGGC 
m AATCTCTAACTTCAATCCCGCCTTCGACAGTGAAAAAGCTCTACTTCTACGCTGACCCAGGGAGG 

AAACACTAGGACCCTGTTGTATCCTCAACTGCAAGTTTCTGGACTAGTCTCCCAACGTTTGCCTC 
CCAATGTTGTCCCTTTCCTTCGTTCCCATGGTAAAGCTCCTCTCGCTTTCCTCCTGAGGCTACAC 
CCATGCGTCTCTAGGAACTGGTCACAAAAGTCATGGTGCCTGCATCCCTGCCAAGCCCCCCTGAC 
CCTCTCTCCCCACTACCACCTTGTTCCTGAGCTGGGGGCACCAGGGAGAATCAGAGATGCTGGGG 
AT GC CAGAGCAAGAC T CAAAGAGGCAGAGG TTTTGTTCT C AAAT AT T T T T T AAT AAAT AGAC GAA 
ACCACG 
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FIGURE 28 



MDILVPLLQLLVLLLTLPLHLMALLGCWQPLCKSYFPYLMAVLTPKSNRKMESKKRELFSQIKGL 
TGASGKVALLELGCGTGANFQFYPPGCRVTCLDPNPHFEKFLTKSMAENRHLQYEREWAPGEDM 
RQIADGSMDVWCTLVLCSVQSPRKVLQEWRVLRPGGVLFFWEHVAEPYGSWAFMWQQVFEPTW 
KHI GDGCCLTRETWKDLENAQFS E I QMERQPPPLKWLPVGPH IMGKAVKQS FPS SKALI CS FPS L 
QLEQATHQPIYLPLRGT 

Important features: 



Signal peptide: 




amino acids 1-23 



Leucine zipper pattern. 



amino acids 10-32 




N-myristoylation sites. 

amino acids 64-70, 78-84, 80-86, 91-97, 201-207 
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FIGURE 2Q 

CAATGTTTGCCTATCCACCTCCCCCAAGCCCCTTTACCTATGCTGCTGCTAACGCTGCTGCTGCT 
GCTGCTGCTGCTTAAAGGCTCATGCTTGGAGTGGGGACTGGTCGGTGCCCAGAAAGTCTCTTCTG 
CCACTGACGCCCCCATCAGGGATTGGGCCTTCTTTCCCCCTTCCTTTCTGTGTCTCCTGCCTCAT 
CGGCCTGCCATGACCTGCAGCCAAGCCCAGCCCCGTGGGG7^AGGGGAGAAAGTGGGGGATGGCTA 
AGAAAGCTGGGAGATAGGGAACAGAAGAGGGTAGTGGGTGGGCTAGGGGGGCTGCCTTATTTAAA 
GTGGTTGTT TAT GAT T CT TAT ACT AAT T T AT AC AAAGAT AT T AAGGCCC T GT T CAT TAAGAAAT T 
GTTCCCTTCCCCTGTGTTCAATGTTTGTAAAGATTGTTCTGTGTAAATATGTCTTTATAATAAAC 
AGTTAAAAGCTGAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE so 

MLLLTLLLLLLLLKGS CLEWGLVGAQKVS SATDAP I RDWAFFP P S FLCLLPHRPAMTCSQAQPRG 
EGEKVGDG 

Important features: 
Signal peptide: 

amino acids 1-15 

Growth factor and cytokines receptors family: 

amino acids 3-18 
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FIGURE 31 

GTTTGAATTCCTTCAACTATACCCACAGTCCAAAAGCAGACTCACTGTGTCCCAGGCTACCAGTT 
CCTCCAAGCAAGTCATTTCCCTTATTTAACCGATGTGTCCCTCAAACACCTGAGTGCTACTCCCT 
ATTTGCATCTGTTTTGATAAATGATGTTGACACCCTCCACCGAATTCTAAGTGGAATCATGTCGG 
GAAGAGATACAATCCTTGGCCTGTGTATCCTCGCATTAGCCTTGTCTTTGGCCATGATGTTTACC 
TTCAGATTCATCACCACCCTTCTGGTTCACATTTTCATTTCATTGGTTATTTTGGGATTGTTGTT 
TGTCTGCGGTGTTTTATGGTGGCTGTATTATGACTATACCAACGACCTCAGCATAGAATTGGACA 
CAGAAAGGGAAAATATGAAGT GCGTGC T GGGGT TT GC TAT CGTAT C CACAGGCATCACGGCAGTG 
CTGCTCGTCTTGATTTTTGTTCTCAGAAAGAGAATAAAA.TTGACAGTTGAGCTTTTCCAAATCAC 
AAATAAAGCCATCAGCAGTGCTCCCTTCCTGCTGTTCCAGCCACTGTGGACATTTGCCATCCTCA 
TTTTCTTCTGGGTCCTCTGGGTGGCTGTGCTGCTGAGCCTGGGAACTGCAGGAGCTGCCCAGGTT 
ATGGAAGGCGGCCAAGTGGAATATAAGCCCCTTTCGGGCATTCGGTACATGTGGTCGTACCATTT 
AATTGGCCTCATCTGGACTAGTGAATTCATCCTTGCGTGCCAGCAAATGACTATAGCTGGGGCAG 
TGGTTACTTGTTATTTCAACAGAAGTAAAAATGATCCTCCTGATCATCCCATCCTTTCGTCTCTC 
TCCATTCTCTTCTTCTACCATCAAGGAACCGTTGTGAAAGGGTCATTTTTAATCTCTGTGGTGAG 
GATTCCGAGAATCATTGTCATGTACATGCAAAACGCACTGAAA.GAACAGCAGCATGGTGCATTGT 
CCAGGTACCTGTTCCGATGCTGCTACTGCTGTTTCTGGTGTCTTGACAAATACCTGCTCCATCTC 

AACC AGAAT GCATAT AC T ACAAC TGC TAT T AAT GGG AC AGAT T T C T GT ACAT CAGC AAAAG AT GC 
ATTCAAAATCTTGTCCAAGAACTCAAGTCACTTTACATCTATTAACTGCTTTGGAGACTTCATAA 
TTTTTCTAGGAAAGGTGTTAGTGGTGTGTTTCACTGTTTTTGGAGGACTCATGGCTTTTAACTAC 
AATCGGGCATTCCAGGTGTGGGCAGTCCCTCTGTTATTGGTAGCTTTTTTTGCCTACTTAGTAGC 
CCATAGTTTTTTATCTGTGTTTGAAACTGTGCTGGATGCACTTTTCCTGTGT.TTTGCTGTTGATC 
TGGAAACAAATGATGGATCGTCAGAAAAGCCCTACTTTATGGATCAAGAATTTCTGAGTTTCGTA 
AAAAGGAGCAACAAAT TAAACAATGCAAGGGCACAGCAGGACAAGCACTCAT TAAGGAATGAGGA 
GGGAACAGAAC T CCAGGCCAT T GTGAGATAGATACC CATT TAGGTATC TGTACC TGGAAAACAT T 
T C C T T C T AAGAG C CAT T T ACAG AAT AG AAG AT GAGAC C AC T AGAGAAAAG T TAG T GAAT t T T T T T 
T TAAAAGACCTAATAAACC CTAT T C T TC C T CAAAA 



FIGURE 32 



MSGRDTILGLCILALALSLAMMFTFRFITTLLVHIFISLVILGLLFVCGVLWWLYYDYTNDLSIE 
LDT ERENMKCVLGFAI VS T G I TAVLLVL I FVLRKR I KLT VE LFQ I TNKAI S SAPFLL FQPLWT FA 
ILIFFWVLWVAVLLSLGTAGAAQVMEGGQVEYKPLSGIRYMWSYHLIGLIWTSEFILACQQMTIA 
GAWTCYFNRSKNDPPDHPI LS SLS ILFFYHQGTWKGS FLI S WRI PRI IVMYMQNALKEQQHG 
ALSRYLFRCCYCCFWCLDKYLLHLNQNAYTTTAINGTDFCTSAKDAFKILSKNSSHFTSINCFGD 
FI I FLGKVLWCFT VFGGLMAFNYNRAFQWAVPLLLVAFFAYLVAHS FLSVFETVLDALFLC FA 
VDLETNDGSSEKPYFMDQEFLSFVKRSNKLNNARAQQDKHSLRNEEGTELQAIVR 

Important features : 
Signal peptide: 

amino acids 1-20 



amino acids 35-54, 75-97, 126-146, 185-204, 333-350, 352-371 



Putative transmembrane domains: 



N-glycosylation sites. 




N-myristoylation sites. 



amino acids 204-208, 295-299, 313-317 



amino acids 147-153, 178-184, 196-202, 296-275, 342-348 
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FIGURE 33 



GTTCGATTAGCTCCTCTGAGAAGAAGAGAAAAGGTTCTTGGACCTCTCCCTGTTTCTTCCTTAGA 
ATAATT T GTATGGGATT T G TGATGCAGGAAAGCCTAAGGGAAAAAGAATAT TCAT TC TGT G T GG T 
GAAAATTTTTTGAAAAAAAAATTGCCTTCTTCAAACAAGGGTGTCATTCTGATATTTATGAGGAC 
TGTTGTTCTCACTATGAAGGCATCTGTTATTGAAATGTTCCTTGTTTTGCTGGTGACTGGAGTAC 
ATTCAAACAAAGAAACGGCAAAGAAGATTAAAAGGCCCAAGTTCACTGTGCCTCAGATCAACTGC 
GAT G TCAAAGCCGGAAAGATCATCGAT CC TGAGT TCAT TGTGAAAT GT CCAGCAGGAT GCCAAGA 
CCCCAAATACCATGTTTATGGCACTGACGTGTATGCATCCTACTCCAGTGTGTGTGGCGCTGCCG 
TACACAGTGGTGTGCTTGATAATTCAGGAGGGAAAATACTTGTTCGGAAGGTTGCTGGACAGTCT 
GGT TACAAAGGGAG T TAT TC CAACGGTGT CCAATCGTT AT CCC TACCAC GATGGAGAGAAT C CT T 
TATCGTCTTAGAAAGTAAACCCAAAAAGGGTGTAACCTACCCATCAGCTCTTACATACTCATCAT 
CGAAAAGTCCAGCT GCCCAAGCAGGTGAGACCACAAAAGC CTATCAGAGGCCAC C TAT TCCAGGG 
ACAACTGCACAGCCGGTCACTCTGATGCAGCTTCTGGCTGTCACTGTAGCTGTGGCCACCCCCAC 
Uf CACCTTGCCAAGGCCATCCCCTTCTGCTGCTTCTACCACCAGCATCCCCAGACCACAATCAGTGG 
p GCCACAGGAGCCAGGAGATGGATCTCTGGTCCACTGCCACCTACACAAGCAGCCAAAACAGGCCC 
S AGAGCTGATCCAGGTATCCAAAGGCAAGATCCTTCAGGAGCTGCCTTCCAGAAACCTGTTGGAGC 

GGATGTCAGCCTGGGACTTGTTCCAAAAGAAGAATTGAGCACACAGTCTTTGGAGCCAGTATCCC 
p TGGGAGATCCAAACTGCAAAATTGACTTGTCGTTTTTAATTGATGGGAGCACCAGCATTGGCAAA 
!jt| CGGCGATTCCGAATCCAGAAGCAGCTCCTGGCTGATGTTGCCCAAGCTCTTGACATTGGCCCTGC 
%l CGGTCCACTGATGGGTGTTGTCCAGTATGGAGACAACCCTGCTACTCACTTTAACCTCAAGACAC 

ACACGAAT TC TCGAGAT CT GAAGACAGCCATAGAGAAAAT T AC TCAGAGAGGAGGAC TT T C TAAT 
GTAGGTCGGGCCATCTCCTTTGTGACCAAGAACTTCTTTTCCAAAGCCAATGGAAACAGAAGCGG 
GGCTCCCAATGTGGTGGTGGTGATGGTGGATGGCTGGCCCACGGACAAAGTGGAGGAGGCTTCAA 
P GACTTGCGAGAGAGTCAGGAATCAACATTTTCTTCATCACCATTGAAGGTGCTGCTGAAAATGAG 

AAGCAGTATGTGGTGGAGCCCAACTTTGCAAACAAGGCCGTGTGCAGAACAAACGGCTTCTACTC 
GCTCCACGTGCAGAGCTGGTTTGGCCTCCACAAGACCCTGCAGCCTCTGGTGAAGCGGGTCTGCG 
ACACTGACCGCCTGGCCTGCAGCAAGACCTGCTTGAACTCGGCTGACATTGGCTTCGTCATCGAC 
GGCTCCAGCAGTGTGGGGACGGGCAACTTCCGCACCGTCCTCCAGTTTGTGACCAACCTCACCAA 
PI AGAGTTTGAGATTTCCGACACGGACACGCGCATCGGGGCCGTGCAGTACACCTACGAACAGCGGC 
fj TGGAGTTTGGGTTCGACAAGTACAGCAGCAAGCCTGACATCCTCAACGCCATCAAGAGGGTGGGC 

TACTGGAGTGGTGGCACCAGCACGGGGGCTGCCATCAACTTCGCCCTGGAGCAGCTCTTCAAGAA 
GTCCAAGCCCAACAAGAGGAAGTTAATGATCCTCATCACCGACGGGAGGTCCTACGACGACGTCC 
GGATCCCAGCCATGGCTGCCCATCTGAAGGGAGTGATCACCTATGCGATAGGCGTTGCCTGGGCT 
GC CCAAGAGGAGC TAGAAG TCATTGCCAC TCACCC CGC CAGAGAC CACT CCT T C TT TGT GGACGA 
GT T T GACAACC TCCAT CAGTATGTC CC CAGGATCAT CCAGAACAT T T GTACAGAGT TCAACTCAC 
AGCC T CGGAAC TGAAT T CAGAGCAGGCAGAGCACCAGCAAG T GCT GCTT TAC TAAC T GACGTGT T 
GGACCACCCCACCGCTTAATGGGGCACGCACGGTGCATCAAGTCTTGGGCAGGGCATGGAGAAAC 
AAATGTCTTGTTATTATTCTTTGCCATCATGCTTTTTCATATTCCAAAACTTGGAGTTACAAAGA 
TGATCACAAACGTATAGAAT GAGCCAAAAGGC TACATCATGT TGAGGGTGCTGGAGATT TTACAT 
T T T GACAAT T GT TT TCAAAATAAATGT T CGGAATACAG T GCAGCCC T TACGACAGGC TTACGTAG 
AGCTTTTGTGAGATTTTTAAGTTGTTATTTCTGATTTGAACTCTGTAACCCTCAGCAAGTTTCAT 
T TT TGTCAT GACAATGTAGGAATTGCTGAAT TAAATGTT TAGAAGGATGAAAAATAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^^ 
AAAAA?\AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAG 



w 
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FIGURE 34 



MRTWLTMKAS VIEMFLVLLVTGVHSNKETAKK I KRPKFT VPQ INCDVKAGKI I DPE FI VKCPAG 
CQDPKYHVYGTDVYASYSSVCGAAVHSGVLDNSGGKILVRKVAGQSGYKGSYSNGVQSLSLPRWR 
ESFIVLESKPKKGVTYPSALTYSSSKSPAAQAGETTKAYQRPPIPGTTAQPVTLMQLLAVTVAVA 
TPTTLPRPSPSAASTTSIPRPQSVGHRSQEMDLWSTATYTSSQNRPRADPGIQRQDPSGAAFQKP 
VGADVS LGLVPKEE L S T QS LE PVS LGDPNCKI DLSFLIDGSTSI GKRR FR I QKQLLADVAQAL D I 
GPAGPLMGWQYGDNPATHFNLKTHTNSRDLKTAIEKITQRGGLSNVGRAISFVTKNFFSKANGN 
RSGAPNWWMVDGWPTDKVEEASRLARE SGINI FFIT IEGAAENEKQYWEPNFANKAVCRTNG 
FYSLHVQSWFGLHKTLQPLVKRVCDTDRLACSKTCLNSADIGFVIDGSSSVGTGNFRTVLQFVTN 
LTKEFEISDTDTRIGAVQYTYEQRLEFGFDKYSSKPDILNAIKRVGYWSGGTSTGAAINFALEQL 
FKKSKPNKRKLMIL I TDGRSYDDVRI PAMAAHLKGVIT YAI GVAWAAQEELEVIATHPARDHS FF 



CI VDEFDNLHQYVPRIIQNICTEFNSQPRN 

Ml 



Important features: 
Signal peptide: 



tU amino acids 1-26 



Transmembrane domain: 

amino acids 181-200 

N-glycosylation sites. 

amino acids 390-394 , 520-524 

N-myristoylation sites. 

amino acids 23-29, 93-99, 115-121, 262-268, 367-373, 389-395, 
431-437, 466-472, 509-515, 570-576, 571-577, 575-581, 627-633 

Amidation site. 

amino acids 304-308 
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FIGURE 35 



CCGAGCACAGGAGATTGCCTGCGTTTAGGAGGTGGCTGCGTTGTGGGAAAAGCTATCAAGGAAGAAATTGC 
CAAACCATGTCTTTTTTTCTGTTTTCAGAGTAGTTCACAACAGATCTGAGTGTTTTAATTAAGCATGGAAT 
AC AG AAAAC AAC AAAAAACT T AAGCT T T AATT T C AT CT GG AATT CC AC AGTT T T CT T AGCTCCC TGGACCC 
GGTTGACCTGTTGGCTCTTCCCGCTGGCTGCTCTATCACGTGGTGCTCTCCGACTACTCACCCCGAGTGTA 
AAGAACCTTCGGCTCGCGTGCTTCTGAGCTGCTGTGG ATGG CCTCGGCTCTCTGGACTGTCCTTCCGAGTA 

GG ATGT CACT GAGAT CCC TC AAAT GGAGCC TCCT GC TGCT GT C ACTCC TGAGT TT C TT T GT GAT GTGGT AC 
CT C AGCCT T CCC C ACT AC AATGT G AT AGAACGCGT G AACT GG AT GT AC TT CT AT GAGT AT GAGC CGAT TT A 
CAGACAAGACTTTCACTTCACACTTCGAGAGCATTCAAACTGCTCTCATCAAAATCCATTTCTGGTCATTC 
TGGTGACCTCCCACCCTTCAGATGTGAAAGCCAGGCAGGCCATTAGAGTTACTTGGGGTGAAAAAAAGTCT 
t K TGGTGGGGATATGAGGTTCTTACATTTTTCTTATTAGGCCAAGAGGCTGAAAAGGAAGACAAAATGTTGGC 
m . AT T GTCCT TAGAGG ATGAAC ACCT TCT TTATGGTGACAT AAT CCGAC AAGAT TT TT T AGAC AC AT AT AAT A 

O ACCTGACCTTGAAAACCATTATGGCATTCAGGTGGGTAACTGAGTTTTGCCCCAATGCCAAGTACGTAATG 
||! AAGACAGAC ACT GATGTT TT C AT C AAT AC T GGCAATTT AGT G AAGT AT CT TT T AAACC T AAACC ACTC AG A 

S l| GAAGTTTTTCACAGGTTATCCTCTAATTGATAATTATTCCTATAGAGGATTTTACCAAAAAACCCATATTT 

CTTACCAGGAGTATCCTTTCAAGGTGTTCCCTCCATACTGCAGTGGGTTGGGTTATATAATGTCCAGAGAT 

TT GGT GCC AAGG AT CTAT GAAAT GAT GGGT CACGT AAAACCC AT C AAG TT T GAAGAT G T TT ATGTCGGGAT 

ill' 

CTGTTTGAATTTATTAAAAGTGAACATTCATATTCCAGAAGACACAAATCTTTTCTTTCTATATAGAATCC 
*>y AT T T GG AT GTCT GT CAACTGAGAC GT GT GATT GC AGCCC AT GGCTTT T CT T CCAAGGAGATC AT CACT T T T 

TA TGGCAGGTCATGCTAAGGAACACCACATGCCATTATTAACTTCACATTCTACAAAAAGCCTAGAAGGACAG 

m — 

m GATACCTTGTGGAAAGTGTTAAATAAAGTAGGTACTGTGGAAAATTCATGGGGAGGTCAGTGTGCTGGCTT 
||| AC AC TG AAC T GAAACT CATG AAAAACCC AGACT GGAGACTGG AGGGTT AC ACT T GT GAT T TATT AGTC AG G 

Q C C CT TC AAAG AT GAT ATGT G GAGGAAT T AAAT AT AAAGG AAT TG GAGGTT TTT GCTAAAG AAAT T AAT AG G 

IV ACCAAACAATTTGGACATGTCATTCTGTAGACTAGAATTTCTTAAAAGGGTGTTACTGAGTTATAAGCTCA 

CTAGGCTGTAAAAACAAAACAATGTAGAGTTTTATTTATTGAACAATGTAGTCACTTGAAGGTTTTGTGTA 
TAT C TT AT GT GG ATT ACC AAT TT AAAAAT AT AT GT AGT TCTGTGT C AAAAAACT TC TT CACT GAAG TT AT A 
CT GAACAAAATT TT AC CT GT T T T T GGT C AT TT AT AAAGT ACT TC AAGATGTT GC AG TAT T TC AC AGTT ATT 
ATT ATT T AAAAT TACT TC AACT TT GT GT T T TT AAAT GT T T T G ACGAT T TCAAT ACAAGAT AAAAAGGAT AG 
TGAATC ATT C T T T AC ATGC AAAC ATT T T CC AGT T AC TT AACT GAT C AG TT TATT AT T GAT AC AT CACT CCA 
TT AATGTAAAGT CAT AGGTC AT T ATT GC AT AT CAGT AAT CT C TT GGAC TT T GT T AAAT AT TT TACT GT GGT 
AAT AT AGAGAAG AAT TAAAGC AAG AAAAT C TGAAAA 



Si 
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FIGURE 36 



MASALWTVLPSRMSLRSLKWSLLLL 

TLREHSNCSHQNPFLVILVTSHPSDVKARQAIRVTWGEKKSWWGYEVLTFFLLGQEAEKEDKMLA 
L S LEDEHLLYGD 1 1 RQD FLDT YNNL TLKT IMA.FRWVTE FCPNAKYVMKTDTDVFINTGNLVKYLL 
NLNHSEKFFTGYPLIDNYSYRGFYQKTHISYQEYPFKVFPPYCSGLGYIMSRDLVPRIYEMMGHV 
KPIKFEDVYVGICLNLLKVNIHIPEDTNLFFLYRIHLDVCQLRRVIAAHGFSSKEIITFWQVMLR 
NTTCHY 

Important features: 

Type II transmembrane domain: 




N-glycosylation sites. 

amino acids 72-7 6, 154-158, 198-202, 212-216, 326-330 



Glycosaminoglycan attachment site. 



amino acids 20-39 




amino acids 239-243 



Ly-6 / u-PAR domain proteins. 



amino acids 23-3.7 



N-myristoylation site. 



amino acids 271-277 
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FIGURE 37 



C GC TCGGGC ACCAGCC GC GGCAAGG ATGG AGC T GGGT TGC T GGACGC AGTT GGGGC TCACT T T TC TTCAGC T CCT TC TC ATC 
TCGTCCT TGCCAAGAGAGT ACACAGTCATTAATGAAGCCT GCCC TGGAGCAGAGT GGAA.TATC ATGTGTCGGGAGT GC TGT G 
AAT ATGATC AGAT TGAGTGC GTC TGCCCCGGAAAGAGGGAAGTCGTGGGTTATACC ATCCC T TGC T GC AGGAATGAGGAGAA 
TGAG TGTGACTC C TGC CTGAT CCACCC AGGT T GTACCATC T T TGAAAAC TGCAAGAGC TGC CGAAAT GGCT CATGGGGGGGT 
ACCTTGGATGACTTCTATGTGAAGGGGTTCTACTGTGCAGAGTGCCGAGCAGGCTGGTACGGAGGAGACTGCATGCGATGTG 
GCC AGGT TC TGCGAGC CCCAAAGGGTC AGATT T TGTT GGAAAGC T ATCCCC TAAATGC TCAC TGTGAAT GGAC CAT T CATGC 
TAAACCTGGGTTTGTCATCCAACTAAGATTTGTCATGTTGAGTCTGGAGTTTGACTACATGTGCCAGTATGACTATGTTGAG 
GT TCGTGATGGAGAC AACCGCGATGGCCAGAT CAT C AAGC GTGT C TGTGGC AAC GAGCGGC C AGC TC C T ATC CAGAGCAT AG 
GATC C TCAC TC C ACGT CCTCT TC CAC TCCGAT GGC TCCAAGAAT TTTGACGGT TT C CATGC CAT T TATGAGGAGATCAC AGC 
ATGCTCCTCATCCCCTTGTTTCCATGACGGCACGTGCGTCCTTGACAAGGCTGGATCTTACAAGTGTGCCTGCTTGGCAGGC 
TATAC TGGGCAGCGC T G TGAAAATC TCC TTGAAGAAAGAAAC TGCTC AGAC CC TGGGGGC C CAGTCAATGGGTAC CAGAAAA 
r' [s TAACAGGGGGCCCTGGGCTTATCAACGGACGCCATGCTAAAATTGGCACCGTGGTGTCTTTCTTTTGTAACAACTCCTATGT 
fc»? TCT T AGTGGCAATGAGAAAAGAAC T TGC CAGCAGAAT GGAGAGT GGTCAGGGAAAC AGC CC ATCTGC ATAAAAGC C T GCCGA 

CI GAACC AAAGAT T TCAGACC TGGTGAGAAGGAGAGT TC TTC C GAT GCAGGTTCAGTC AAGGGAGAC ACC ATTACAC C AGC TAT 

y 1 AC TCAGC GGCC T TCAGCAAGC AGAAAC T GCAGAGTGC CCC T ACCAAGAAGCC AGC CC TTCCC T TT GGAGATC TGCCC ATGGG 



ATACC AACATC T GCATACCCAGC TCCAGTAT GAGTGC ATC TCACCCTTCTACCGCCGCC TGGGCAGCAGCAGGAGGACATGT 
C TGAGGACTGGGAAGT GGAGT GGGCGGGCAC CATCC TGC ATCCC TATC TGC GGGAAAAT TGAGAACATCACTGC T CCAAAGA 
CCC AAGGGT TGC GCTGGCCGTGGC AGGCAGC CATC TACAGGAGGACCAGCGGGGTGCATGACGGCAGCC TAC ACAAGGGAGC 
GTGGTTCCTAGTCTGCAGCGGTGCCCTGGTGAATGAGCGCACTGTGGTGGTGGCTGCCCACTGTGTTACTGACCTGGGGAAG 
GTCAC CATGAT C AAGAC AGCAGACC TGAAAGT T GT T TTGGGGAAATTC TACCG GGATGATGACCGGGAT GAGAAGACCAT C C 



CC T AGAC AAGGC CCGT ATCAGCACCCGAGTC C AGC CCATC TGCCTCGCTGC CAGT C GGGATCTCAGC AC T TCC TTCC AGGAG 
TCCC ACATC AC T GTGGC TGGC TGGAATGTCC TGGCAGACGT GAGGAGC C CT GGCTT CAAGAACGACAC ACTGCGC T C TGGGG 



f'i AGAGC CT AC AGATTTC T GC TATC AT TC TGCAT C CC AAC TAT GAC CCCAT CC TGCT T GATGC TGAC AT C GCCATCC TGAAGC T 

II! 
CI 

;ral TGG TCAGTGTGGTGGAC TC GC TGC TGTGTGAGGAGCAGC ATGAGGAC C ATGGCAT C CCAGT GAGTGT C AC TGATAAC AT GT T 

Al% CTGTGCC AGCT GGGAACCC AC TGCCCCT TCTGATATC TGC ACTGC AGAGACAGGAGGCATCGCGGCTGTGTC C TTC CCGGGA 

2 '■£ 

mi CGAGCATC TCC T GAGCCAC GC TGGCATC TGATGGGAC TGGTC AGC TGGAGC TATGATAAAACATGCAGCCACAGGC TCTC CA 



CTGCC TTCACC AAGGTGCT GC CT T T TAAAGAC TGGAT TGAAAGAAAT ATGAAATGAACC ATGC TC ATGC ACT C CT T GAGAAG 
TGTTTCTGTATATCCGTCTGTACGTGTGTCATTGCGTGAAGCAGTGTGGGCCTGAAGTGTGATTTGGCCTGTGAACTTGGCT 
GTGC CAGGGCT TCTGAC T TC AGGGACAAAAC TCAGTGAAGGGTGAGTAGAC C T CCATTGC TGGTAGGC TGAT GC CGC GTCCA 
CTAC TAGGACAGCCAAT TGGAAGATGCCAGGGC TT GCAAGAAGTAAGTT TC T T CAAAGAAGACCATATACAAAACC TCTCC A 
CTCCACTGACCTGGTGGTCTTCCCCAACTTTCAGTTATACGAATGCCATCAGCTTGACCAGGGAAGATCTGGGCTTCATGAG 
GCCCCTTTTGAGGCTCTCAAGTTCTAGAGAGCTGCCTGTGGGACAGCCCAGGGCAGCAGAGCTGGGATGTGGTGCATGCCTT 
TGTGTACATGGCCACAGTACAGTCTGGTCCTTTTCCTTCCCCATCTCTTGTACACATTTTAATAAAATAAGGGTTGGCTTCT 
GAAC TAC AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



APP ID=1 0063732 



Page 191 of 322 



5 i 



FIGURE 38 



MELGCWTQLGLTFLQLLLISSLPREYTVINEACPGAEWNIMCRECCEYDQIECVCPGKRSWGYT 
IPCCRNEENECDSCLIHPGCTIFENCKSCRNGSWGGTLDDFYVKGFYCAECRAGWYGGDCMRCGQ 
VLRAPKGQI LLES YPLNAHCEWT IHAKPGFVI QLRFVMLSLEFDYMCQYDYVEVRDGDNRDGQ 1 1 
KRVCGNERPAP IQS IGSSLHVL FHS DGSKNFDGFHAI YEE I TAGS S S PCFHDGTCVLDKAGSYKC 
ACLAGYTGQRCENLLEERNCSDPGGPVNGYQKITGGPGLINGRHAKIGTWSFFCNNSYVLSGNE 
KRTCQQNGE WS GKQP I C IKACREPK I S DLVRRRVLPMQVQSRE TPLHQLYS AAFSKQKLQS APTK 
KPALPFGDLPMGYQHLHTQLQYECISPFYRRLGSSRRTCLRTGKWSGRAPSCIPICGKIENITAP 
KTQGLRWPWQAAIYRRTSGVHDGSLHKGAWFLVCSGALVNERTVWAAHCVTDLGKVTMIKTADL 
1:4 KWLGKFYRDDDRDEKTIQSLQISAIILHPNYDPILLDADIAILKLLDKARISTRVQPICLAASR 
CJ DLSTSFQESHITVAGWNVLADVRSPGFKNDTLRSGWSWDSLLCEEQHEDHGIPVSVTDNMFCA 

SWEPTAPSDICTAETGGIAAVSFPGRASPEPRWHLMGLVSWSYDKTCSHRLSTAFTKVLPFKDWI 
ERNMK 



jjjj Important features of the protein: 



Signal peptide: 

amino acids 1-23 

EGF-like domain cysteine pattern signature. 



amino acids 260-272 



N-glycosylation sites. 



£!l amino acids 96-100, 279-283, 316-320, 451-455, 614-618 



N-myristoylation sites. 

amino acids 35-41, 97-103, 256-262, 284-290, 298-304, 308-314, 

474-480, 491-497, 638-644, 666-672 

Ami da ti on site. 

amino acids 56-60 

Serine proteases, trypsin family. 

amino acids 489-506 

CUB domain proteins profile. 

amino acids 150-167 
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G GTT C CT ACAT C CT CTCAT CT GAGAAT CAGAGAGCATAAT CTT CT TAC GGGCCCGT GATTTAT TAAC GT GGCTTAATC 
TGAAGGTTCTCAGTCAAATTCTTTGTGATCTACTGATTGTGGGGGCATGGCAAGGTTTGCTTAAAGGAGCTTGGCTGG 
TTTGGGCCCTTGTAGCTGACAGAAGGTGGCCAGGGAGAATGCAGCACACTGCTCGGAGAATGAAGGCGCTTCTGTTGC 
TGGTCTTGCCTTGGCTCAGTCCTGCTAACTACATTGACAATGTGGGCAACCTGCACTTCCTGTATTCAGAACTCTGTA 
AAGGT GCCT C CCACTACGG CCT GACCAAAGATAGGAAGAG GCGC T CACAAGATGGCT GT CCAGAC GGCT GTGCGAGC C 
TCACAGCCACGGCTCCCTCCCCAGAGGTTTCTGCAGCTGCCACCATCTCCTTAATGACAGACGAGCCTGGCCTAGACA 
AC CCT GCCTACGTGT CCT C GGCAGAGGACG GGCAG CCAG CAAT CAGC CC AGT GGACT CT GGCC G GAGCAACC GAACTA 
GGGCACGGCC CT TT GAGAGAT C CACTATT AGAAGCAGAT CAT TTAAAAAAAT AAAT CGAGCTTT GAGT GT TCTT CGAA 
GGACAAAGAGCGGGAGT GCAGTTGCCAAC CAT GC C GACCAGGGCAGGGAAAATT C T GAAAACACCACT GCCC CT GAAG 
TCTT T CCAAGGT T GTACCAC CT GATTC CAGAT GGT GAAAT T ACCAGCAT CAAGAT CAAT C GAGT AGAT CCCAGT GAAA 
GCCT CT CT ATTAGG CTGGT GGGAGGTAGC GAAAC C CCACT GGTCCATAT CATT AT CCAAC ACATTTAT CGT GAT GGGG 
T GAT C GC CAGAGAC GGC CGGCT ACT GCCAGGAGACAT CAT T CTAAAGGT C AAC GGGAT GGACAT CAGCAATGTC CCT C 



ACAACTACGCTGTGCGTCTCCTGCGGCAGCCCTGCCAGGTGCTGTGGCTGACTGTGATGCGTGAACAGAAGTTCCGCA 



GCAGGAACAATGGACAGGC CCCGGATGCCTACAGACCC C GAGAT GACAG CTTT CAT G T GATT CT CAACAAAAGT AGC C 
CC GAGGAGCAGCTT GGAATAAAACT GGT GCGCAAGGT G GAT GAGCCTGGGGT TT T CAT CTTCAAT GT GCT GGAT GGC G 
GT GTGGCAT ATC GACAT GGT CAGCTT GAGGAGAAT GACC GT GT GT TAGC CAT CAAT GGACAT GAT CT TCGAT AT GGCA 
GCCCAGAAAGTGCGGCTCATCTGATTCAGGCCAGTGAAAGACGTGTTCACCTCGTCGTGTCCCGCCAGGTTCGGCAGC 
GGAGCCCTGACAT CTTT CAG GAAGCCGG C T GGAACAG CAAT GGCAGCT GGTC C CC AGGGC CAGGGGAGAGGAGCAACA 
CT CCCAAGCC CCT CCAT CCTACAAT TACT TGT CAT GAGAAGGT GGTAAAT AT CCAAAAAGACCC C GGT GAAT CT CTC G 
GCAT GACCGT CG CAGGGGGAGCAT CAC AT AGAGAATGGGATT T GC CTAT CTAT GT CAT CAGT GT T GAGC C CGGAGGAG 
TCAT AAGCAGAGAT GGAAGAATAAAAACAGGT GACAT T TT GT TGAAT GT GGAT GGGGT CGAACT GACAGAGGT CAGCC 
GGAGT GAGGCAGT GGCAT TATT GAAAAGAACAT CAT C CT CGATAGTACT CAAAGCTTT GGAAGT CAAAGAGTAT GAGC 
CCCAGGAAGACTGCAGCAGCCCAGCAGCCCTGGACTCCAACCACAACATGGCCCCACCCAGTGACTGGTCCCCATCCT 
GGGT CAT GT GGCT GGAATT ACCACGGT GCT TGTAT AACT GTAAAGAT AT T GTATTACGAAGAAACAC AGCTGGAAGT C 
TGGG CTT CT GCAT T GTAGGAGGT TATGAAGAATACAAT GGAAACAAACCTTT TTT CAT CAAAT C CATT GTTGAAGGAA 
CACCAGCATACAAT GAT GGAAGAATTAGAT GT GGT GAT AT TCTT CTTGCT GT CAATGGTAGAAGTACAT CAGGAATGA 
TACATGCTTGCTTGGCAAGACTGCTGAAAGAACTTAAAGGAAGAATTACTCTAACTATTGTTTCTTGGCCTGGCACTT 
TT TT ATAGA ATCAAT GAT G GGT CAGAGGAAAACAGAAAAATCACAAATAG GCTAAGAAGTTGAAACACT ATAT TTAT C 
TT GT CAGTT TTTATATTTAAAGAAAGAAT ACAT T GTAAAAAT GT CAGGAAAAGT ATGAT CAT CT AAT GAAAGC CAGTT 
ACAC C TCAGAAAATATGATT CCAAAAAAATTAAAACTACTAGTTT TTT TT CAGT GT GGAGGAT T T CT CATTACT CT AC 
AACAT TGTT T ATAT TTTT T CTAT T CAATAAAAAGCCCTAAAACAACT AAAAT GATT GAT TT GTATAC CC CACTGAATT 
CAAGCTGAT TTAAATTT AAAATT T GGTATATGCT GAAGT CTGCCAAG GGT ACATTAT G GC CAT T TTTAATTT AC AGCT 
AAAATAT TT TTTAAAAT GCATT GCT GAGAAACGTT GCTTT CAT CAAACAAGAATAAAT ATTTT T CAGAAGTT AAA 
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FIGURE AO 

MKALLLLVLPWLSPiWriDNVGNLHFLYSELCKGASHYGLTKDRKRRSQDGCPDGCASLTATAPS 
PEVSAAAT I S LMTDEPGLDNPAYVS S AEDGQPAI S PVDSGRSNRTRARPFERS T IRSRS FKKINR 
ALSVLRRTKSGSAVANHADQGRENSENTTAPEVFPRLYHLIPDGEITSIKINRVDPSESLSIRLV 
GGSETPLVHI IIQHIYRDGVIARDGRLLPGDI ILKVNGMDI SNVPHNYAVRLLRQPCQVLWLTVM 
REQKFRSRNNGQAPDAYRPRDDSFHVILNKSSPEEQLGIKLVRKVDEPGVFIFNVLDGGVAYRHG 
QLEENDRVLAINGHDLRYGSPESAAHLIQASERRVHLWSRQVRQRSPDIFQEAGWNSNGSWSPG 
PGERSNTPKPLHPTITCHEKWNIQKDPGESLGMTVAGGASHREWDLPIYVISVEPGGVISRDGR 
I KT GDI LLNVDGVEL TE VSRS EAVAL LKRT S SSI VLKALEVKE YE PQEDCS S PAALDSNHNMAP P 
SDWSPSWVMWLELPRCLYNCKDIVLRRNTAGSLGFCIVGGYEEYNGNKPFFIKSIVEGTPAYNDG 
RIRCGDILLAVNGRSTSGMIHACLARLLKELKGRITLTIVSWPGTFL 

Important features: 
Signal peptide: 

amino acids 1-15 

N-glycosylation sites. 

amino acids 108-112, 157-161, 289-293, 384-388 

Tyrosine kinase phosphorylation sites. 

amino acids 433-441, 492-500 

N-myristoylation sites. 

amino acids 51-57, 141-147, 233-239, 344-350, 423-429, 447-453, 
467-473, 603-609 
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FIGURE 41 



ACCAGGCATTGTATCTTCAGTTGTCATCAAGTTCGCAATCAGATTGGAAAAGCTCAACTTGAAGCTTT 
CTTGCCTGCAGTGAAGCAGAGAGATAGATATTATTCACGTAATAAAAAACATGGGCTTCAACCTGACT 
TTCCACCTTTCCTACAAATTCCGATTACTGTTGCTGTTGACTTTGTGCCTGACAGTGGTTGGGTGGGC 
CACCAGTAACTACTT C GTGGGTGCCAT TCAAGAGAT TCCTAAAGCAAAGGAGTTCAT GGCTAAT T TC C 
ATAAGACCCTCAT TTT GGGGAAGGGAAAAACT CTGACTAATGAAGCAT CCACGAAGAAGGTAGAACTT 
GACAACTGTCCTTCTGTGTCTCCTTACCTCAGAGGCCAGAGCAAGCTCATTTTCAAACCAGATCTCAC 
T TT GGAAGAGGTACAGGCAGAAAAT CC CAAAGT GT CCAGAGGCCGGTAT CGCCCT CAGGAAT GT AAAG 
CTTTACAGAGGGTCGCCATCCTCGTTCCCCACCGGAACAGAGAGAAACACCTGATGTACCTGCTGGAA 
CATCTGCATCCCTTCCTGCAGAGGCAGCAGCTGGATTATGGCATCTACGTCATCCACCAGGCTGAAGG 
H TAAAAAGT TTAATCGAGCCAAAGTCT T GAATGT GGGCTAT CTAGAAGCC CT CAAGGAAGAAAAT TGGG 

ACTGCTTTATATTCCACGATGTGGACCTGGTACCCGAGAATGACTTTAACCTTTACAAGTGTGAGGAG 
CATCCCAAGCATCTGGTGGTTGGCAGGAACAGCACTGGGTACAGGTTACGTTACAGTGGATATTTTGG 
g .. s G GGT GT T AC T GC C CT AAGC AGAGAGCAG T T T T T C AAGGT GAAT GGAT T C TC TAACAAC TAC T GG GGAT 

SJ GGGGAGGCGAAGACGATGACCTCAGACTCAGGGTTGAGCTCCAAAGAATGAAAATTTCCCGGCCCCTG 
till C CT GAAGT GGG T AAATATAC AAT GGT C T TCC AC AC TAGAGACAAAGG CAAT GAG G T GAAC G CAGAAC G 

GAT GAAGCT CT T ACACCAAGTGT CACGAGTCT GGAGAACAGAT GGGT TGAGTAGTTGT T CT TATAAAT 
TAGTATCTGTGGAACACAATCCTTTATATATCAACATCACAGTGGATTTCTGGTTTGGTGCATGACCC 
TGGATCTTTTGGTGATGTTTGGAAGAACTGATTCTTTGTTTGCAATAATTTTGGCCTAGAGACTTCAA 
ATAGTAGCACACATTAAGAACCT GT TACAGCT CAT TGTTGAGC T GAAT T TT T CCTTT TT GT ATTT TCT 



f§8 



m TAGCAGAGC TC CTGGT GAT GTAGAGTATAAAACAGTTGTAACAAGACAGCT T T CT TAGT CATTTT GAT 

fl CAT GAGG GT TAAATAT T GT AAT AT GGAT AC T T GAAGGAC T T T AT ATAAAAG GAT GAC T CAAAGGAT AA 

Rj AATGAACGCTATTTGAGGACTCTGGTTGAAGGAGATTTATTTAAATTTGAAGTAATATATTATGGGAT 

AAAAGGCCACAGGAAATAAGACTGCTGAATGTCTGAGAGAACCAGAGTTGTTCTCGTCCAAGGTAGAA 
AGGTACGAAGATACAATACT GTTAT TCATTTATCCTGTACAAT CATCT GTGAAGTGGTGGT GTCAGGT 
GAGAAGGCGTCCACAAAAGAGGGGAGAAAAGGCGACGAATCAGGACACAGTGAACTTGGGAATGAAGA 
GGTAGCAGGAGGGTGGAGTGTCGGCTGCAAAGGCAGCAGTAGCTGAGCTGGTTGCAGGTGCTGATAGC 
CTTCAGGGGAGGACCTGCCCAGGTATGCCTTCCAGTGATGCCCACCAGAGAATACATTCTCTATTAGT 
TTT T AAAG AGT T T T T G T AAAAT GAT T T T G T AC AAG T AGG AT AT GAAT TAG C AG TT TAC AAG T T T ACAT 
ATTAACTAATAATAAATAT GTCTAT CAAATACCTCT GTAGTAAAATGT GAAA&AGCAAAA 
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FIGURE 42 



MG FNLT FHLS YKFRLLLLLT LCLT WGWAT SNYFVGAI QE I PKAKE FMANFHKTL I LGKGKTLTN 
EAS TKKVELDNC PS VS P YLRGQS KL I FKPDLT LEE VQAENPKVS RGRYRPQE CKALQRVAI LVPH 
RNREKHLMYLLEHLHPFLQRQQLDYGIYVIHQAEGKKFNRAKLLNVGYLEALKEENWDCFIFHDV 
DLVPENDFNLYKCEEHPKHLWGRNSTGYRLRYSGYFGGVTALSREQFFKVNGFSNNYWGWGGED 
DDLRLRVELQRMKISRPLPEVGKYTMVFHTRDKGNEVNAERMKLLHQVSRVWRTDGLSSCSYKLV 
SVEHNPLYINITVDFWFGA 

Important features: 
Signal peptide: 

amino acids 1-27 



m 



N-glycosylation sites . 

amino acids 4-8, 220-224, 335-339 




Xylose isomerase proteins. 




amino acids 191-202 
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FIGURE 43 



m 

w 



hi 



GCTCAAGACCCAGCAGT GGGACAGCCAGACAGACGGCACGATGGCAC T GAGC TC CCAGAT C TGGG 

CCGCTTGCCTCCTGCTCCTCCTCCTCCTCGCCAGCCTGACCAGTGGCTCTGTTTTCCCACAACAG 

ACGGGACAACTTGCAGAGCTGCAACCCCAGGACAGAGCTGGAGCCAGGGCCAGCTGGATGCCCAT 

GTTCCAGAGGCGAAGGAGGCGAGACACCCACTTCCCCATCTGCATTTTCTGCTGCGGCTGCTGTC 

ATCGATCAAAGTGTGGGATGTGCTGCAAGACGTAGAACCTACCTGCCCTGCCCCCGTCCCCTCCC 

TTCCTTATTTATTCCTGCTGCCCCAGAACATAGGTCTTGGAATAAAATGGCTGGTTCTTTTGTTT 

TCCAAAAAAAAAAAAAAZ\AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 4d 



MALSSQIWAACLLLLLLLASLTSGSVFPQQTGQLAELQPQDRAGARASWMPMFQRRRRRDTHFPI 
CIFCCGCCHRSKCGMCCKT 

Important features: 
Signal peptide: 

amino acids 1-24 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 58-59 




N-myristoylation site. 



Prokaryotic membrane lipoprotein lipid attachment site. 



amino acids 44-50 



amino acids 1-12 



II! 

f ! 
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FIGURE 45 

GTGGCTTCATTTCAGTGGCTGACTTCCAGAGAGCAAT ATG GCTGGTTCCCCAACATGCCTCACCC 
TCATCTATATCCTTTGGCAGCTCACAGGGTCAGCAGCCTCTGGACCCGTGAAAGAGCTGGTCGGT 
TCCGTTGGTGGGGCCGTGACTTTCCCCCTGAAGTCCAAAGTAAAGCAAGTTGACTCTATTGTCTG 
GACC TT CAACACAACCCCTCT TGTCACCATACAGCCAGAAGGGGGCACTATCATAGTGACCCAAA 
ATCGTAATAGGGAGAGAGTAGACTTCCCAGATGGAGGCTACTCCCTGAAGCTCAGCAAACTGAAG 
AAGAAT GACT CAGGGATCTACTATGTGGGGATATACA.GCTCATCAC TCCAGCAGCCCTCCACCCA 
GGAGTACGTGCTGCATGTCTACGAGCACCTGTCAAAGCCTAAAGTCACCATGGGTCTGCAGAGCA 
AT AAGAAT GGCACCT GT GTGAC CAATCT GACATGC TGCAT GGAACAT GGGGAAGAGGATGT GAT T 
TATACCTGGAAGGCCCTGGGGCAAGCAGCCAATGAGTCCCATAATGGGTCCATCCTCCCCATCTC 
CTGGAGATGGGGAGAAAGTGATATGACCTTCATCTGCGTTGCCAGGAACCCTGTCAGCAGAAACT 
TCTCAAGCCCCATCCTTGCCAGGAAGCTCTGTGAAGGTGCTGCTGATGACCCAGATTCCTCCATG 
GTCCTCCTGTGTCTCCTGTTGGTGCCCCTCCTGCTCAGTCTCTTTGTACTGGGGCTATTTCTTTG 
GT T T CT GAAGAGAGAGAGACAAGAAGAGT ACAT TGAAGAGAAGAAGAGAGT G GACAT TTG T CGGG 
AAACTCCTAACATATGCCCCCATTCTGGAGAGAACACAGAGTACGACACAATCCCTCACACTAAT 
AGAACAATCCTAAAGGAAGATCCAGCAAATACGGT T TACTCCACTGTGGAAATACCGAAAAAGAT 
GGAAAATCCCCACTCACTGCTCACGATGCCAGACACACCAAGGCTATTTGCCTATGAGAATGTTA 
T C TAGACAGCAGTGCAC T CCCC TAAGT CT CT GC TCA 
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FIGURE J.6 

MAGSPTCLTLIYILWQLTGSAASGPVKELVGSVGGAVTFPLKSKVKQVDSIVWTFNTTPLVTIQP 
EGGTIIVTQNRNRERVDFPDGGYSLKLSKLKKNDSGIYYVGIYSSSLQQPSTQEYVLHVYEHLSK 
PKVTMGLQSNKNGTCVTNLTCCMEHGEEDVIYTWKALGQAANESHNGSILPISWRWGESDMTFIC 
VARNPVSRNFSSPILA&KLCEG 

EKKRVDICRETPNICPHSGENTEYDTIPHTNRTILKEDPANTVYSTVEIPKKMENPHSLLTMPDT 
PRLFAYENVI 

Important features: 
Signal peptide: 

amino acids 1-22 

Transmembrane domain: 

amino acids 224-250 

Leucine zipper pattern. 

amino acids 229-251 

N-glycosylation sites. 

amino acids 98-102, 142-146, 148-152, 172-176, 176-180, 204-208, 
291-295 
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FIGURE 47 



GGCTCGAGCGTTTCTGAGCCAGGGGTGACC ATGA CCTGCTGCGAAGGATGGACATCCTGCAATGG 
ATTCAGCCTGCTGGTTCTACTGCTGTTAGGAGTAGTTCTCAATGCGATACCTCTAATTGTCAGCT 
TAGTTGAGGAAGACCAATTTTCTCAAAACCCCATCTCTTGCTTTGAGTGGTGGTTCCCAGGAATT 
ATAGGAGCAGGTC TGAT GGCCAT TC CAGCAACAACAAT GT CCT TGACAGCAAGAAAAAGAGCGTG 
C T GC AACAACAGAAC T GGAAT GTTTCTTT CATC AT T T T T C AGTGTGAT C AC AGT CAT T GGT GCTC 
TGTATTGCATGCTGATATCCATCCAGGCTCTCTTAAAAGGTCCTCTCATGTGTAATTCTCCAAGC 
AACAGTAATGCCAAT TGTGAATTTTCAT T GAAAAACATCAGTGACAT TCATCCAGAATCCTTCAA 
CTTGCAGTGGTTTTTCAATGACTCTTGTGCACCTCCTACTGGTTTCAATAAACCCACCAGTAACG 
■ . ACACCAT GGCGAGTGGC T GGAGAGCATC TAGT T TC CACT T CGAT TC T GAAGAAAACAAACATAGG 

m t CTTATCCACTTCTCAGTATTTTTAGGTCTATTGCTTGTTGGAATTCTGGAGGTCCTGTTTGGGCT 
|,;| CAGTCAGATAGTCATCGGTTTCCTTGGCTGTCTGTGTGGAGTCTCTAAGCGAAGAAGTCAAATTG 
9| T GTAGT T T AAT GGGAAT AAAAT GTAAGT AT CAG TAG T T T GAAAAAAAAAAA 



If! 



o 
III 
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FIGURE 48 



MTCCEGWTSCNGFSLLVLLLLGWLNAIPLIVSLVEEDQFSQNPISCFEWWFPGIIGAGLMAIPA 
TTMS LT ARKRACCNNRT GMFLS S FFS VI T VI GAL YCML I S I QALLKG PLMCNS P SNSNANCE FS L 
KNISDIHPESFNLQWFFNDSCAPPTGFNKPTSNDTMASGWRASSFHFDSEENKHRLIHFSVFLGL 
LL VG I LE VL FGLSQIVIGFLGCLC G VS KRRS Q I V 

Important features: 
Transmembrane domains: 

amino acids 10-31 (type II), 50-72, 87-110, 191-213 
N-glycosylation sites. 

amino acids 80-84, 132-136, 148-152, 163-167 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 223-227 

N-myristoylation sites. 

amino acids 22-28, 54-60, 83-89, 97-103, 216-222 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 207-218 

TNFR/NGFR family cysteine-rich region protein. 

amino acids 4-12 
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FIGURE 4Q 
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III 
il 



Ril 



ATCCGTTCTCTGCGCTGCCAGCTCAGGTGAGCCCTCGCCAAGGTGACCTCGCAGGACACTGGTGA 
AGGAGCAGTGAGGAACCTGCAGAGTCACACAGT TGC T GACCAATTGAGCTGTGAGCC TGGAGCAG 
ATCCGTGGGCTGCAGACCCCCGCCCCAGTGCCT.CTCGCCCTGCAGCCCTGCCCCTCGAACTGTGA 
CATGGAGAGAGTGACCCTGGCCCTTCTCCTACTGGCAGGCCTGACTGCCTTGGAAGCCAATGACC 
CATT TGCCAATAAAGAC GATC CC T TC T AC TATGAC TGGAAAAACC TGGAGCT GAGCGGAC T GAT C 
TGCGGAGGGCTCCTGGCCATTGCTGGGATCGCGGCAGTTCTGAGTGGCAAATGCAAATACAAGAG 
CAGCCAG AAGCAGCACAGT CC TGTACC TG AGAAGGCCAT CCC AC T CAT C AC T C C AGGCT CT GCC A 
CTACTTGCTGAGCACAGGACTGGCCTCCAGGGATGGCCTGAAGCCTAACACTGGCCCCCAGCACC 
TCCTCCCCTGGGAGGCCTTATCCTCAAGGAAGGACTTCTCTCCAAGGGCAGGCTGTTAGGCCCCT 
TTCTGATCAGGAGGCTTCTTTATGAATTAAACTCGCCCCACCACCCCCTCA 
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FIGURE so 



$ 

CI 



MERVTLALLLIAGLTALEANDPFiWKDDPFYYDWKNLQLSGLICGGLLAIAGIAAVLSGKCKYKS 
SQKQHSPVPEKAIPLITPGSATTC 

Important features: 
Signal peptide: 

amino acids 1-16 

Transmembrane domain: 

amino acids 36-59 

N-myristoylation sites. 



Ft 

IP amino acids 41-47, 45-51, 84-90 

hi 
SJ 

i: s 



Extracellular proteins SCP/Tpx-l/Ag5/PR-l/Sc7 . 

amino acids 54-67 
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FIGURE zi 



GTGGACTCTGAGAAGCCCAGGCAGTTGAGGACAGGAGAGAGAAGGCTGCAGACCCAGAGGGAGGG 
AGGACAGGGAGTCGGAAGGAGGAGGACAGAGGAGGGCACAGAGACGCAGAGCAAGGGCGGCAAGG 
AGGAGACCCTGGTGGGAGGAAGACACTCTGGAGAGAGAGGGGGCTGGGCAGAGATGAAGTTCCAG 
GGGCCCCTGGCCTGCCTCCTGCTGGCCCTCTGCCTGGGCAGTGGGGAGGCTGGCCCCCTGCAGAG 
CGGAGAGGAAAGCACTGGGACAAATATTGGGGAGGCCCTTGGACATGGCCTGGGAGACGCCCTGA 
GCGAAGGGGTGGGAAAGGCCATTGGCAAAGAGGCCGGAGGGGCAGCTGGCTCTAAAGTCAGTGAG 
GCCCTTGGCCAAGGGACCAGAGAAGCAGTTGGCACTGGAGTCAGGCAGGTTCCAGGCTTTGGCGC 
AGCAGATGCTTTGGGCAACAGGGTCGGGGAAGCAGCCCATGCTCTGGGAAACACTGGGCACGAGA 
TTGGCAGACAGGCAGAAGATGTCATTCGACACGGAGCAGATGCTGTCCGCGGCTCCTGGCAGGGG 
GTGCCTGGCCACAGTGGTGCTTGGGAAACTTCTGGAGGCCATGGCATCTTTGGCTCTCAAGGTGG 
Q CCTTGGAGGCCAGGGCCAGGGCAATCCTGGAGGTCTGGGGACTCCGTGGGTCCACGGATACCCCG 

GAAACTCAGCAGGCAGCTTTGGAATGAATCCTCAGGGAGCTCCCTGGGGTCAAGGAGGCAATGGA 
GGGCCACCAAACTTTGGGACCAACACTCAGGGAGCTGTGGCCCAGCCTGGCTATGGTTCAGTGAG 
AGCCAGCAACCAGAAT GAAGGG TGCACGAAT C CCCCACCATC TGGC TC AGGTGGAGGCT C CAGCA 
ACTCTGGGGGAGGCAGCGGCTCACAGTCGGGCAGCAGTGGCAGTGGCAGCAATGGTGACAACAAC 
'* AATGGCAGCAGCAGTGGTGGCAGCAGCAGTGGCAGCAGCAGTGGCAGCAGCAGTGGCGGCAGCAG 
Q TGGCGGCAGCAGTGGTGGCAGCAGTGGCAACAGTGGTGGCAGCAGAGGTGACAGCGGCAGTGAGT 

s m 

S: CCTCCTGGGGATCCAGCACCGGCTCCTCCTCCGGCAACCACGGTGGGAGCGGCGGAGGAAATGGA 

S CATAAACCCGGGTGTGAAAAGCCAGGG7VATGAAGCCCGCGGGAGCGGGGAATCTGGGATTCAGGG 

p C T T CAGAG GAC AGG GAG T T T C C AGCAACAT GAGG GAAAT AAG C AAAGAG G GC AATC GCCTCCTTG 

H| GAGGCTCTGGAGACAATTATCGGGGGCAAGGGTCGAGGTGGGGCAGTGGAGGAGGTGACGCTGTT 

GGTGGAGTCAATACTGTGAACTCTGAGACGTCTCCTGGGATGTTTAACTTTGACACTTTCTGGAA 
GAATTTTAAATCCAAGCTGGGTTTCATCAACTGGGATGCCATAAACAAGGACCAGAGAAGCTCTC 
GCATCCCGTGACCTCCAGACAAGGAGCCACCAGATTGGATGGGAGCCCCCACACTCCCTCCTTAA 
AACACCACCCTCTCATCACTAATCTCAGCCCTTGCCCTTGAAATAAACCTTAGCTGCCCCACAAA 

^^^^ 



m 
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FIGURE S2 



MKFQGPLACLLLALCLGSGEAGPLQSGEESTGTNIGEALGHGLGDALSEGVGKAIGKEAGGAAGSKVS 
E AL G QGT RE AVGT GVRQVP G F GAADAL GNRVGE AAHALGNTGHE I GRQAE DVI RHGADAVRGS W QGVP 
GHSGAWETSGGHGIFGSQGGLGGQGQGNPGGLGTPWVHGYPGNSAGSFGMNPQGAPWGQGGNGGPPNF 
GTNTQGAVAQPGYGSVRASNQNEGCTNPPPSGSGGGSSNSGGGSGSQSGSSGSGSNGDNNNGSSSGGS 
SSGSSSGSSSGGSSGGSSGGSSGNSGGSRGDSGSESSWGSSTGSSSGNHGGSGGGNGHKPGCEKPGNE 
ARGSGESGIQGFRGQGVSSNMREISKEGNRLLGGSGDNYRGQGSSWGSGGGDAVGGWTWSETSPGM 
FNFDTFWKNFKSKLGFINWDAINKDQRSSRIP 

Signal peptide: 



^ amino acids 1-21 

ci 



N-glycosylation site. 



u \ amino acids 265-269 



Glycosaminoglycan attachment site. 



Til amino acids 235-239, 237-241, 244-248, 255-259, 324-328, 388-392 

jfWft 
W 



Casein kinase II phosphorylation site. 

amino acids 26-30, 109-113, 259-263, 300-304, 304-308 

!'■■! N-myristoylation site. 

R| -amino acids 17-23, 32-38, 42-48, 50-56, 60-66, 61-67, 64-70, 74-80, 

90-96, 96-102, 130-136, 140-146, 149-155, 152-158, 155-161, 

159-165, 163-169, 178-184, 190-196, 194-200, 199-205, 218-224, 

236-242, 238-244, 239-245, 240-246, 245-251, 246-252, 249-252, 

253-259, 256-262, 266-272, 270-276, 271-277, 275-281, 279-285, 

283-289, 284-290, 287-293, 288-294, 291-297, 292-298, 295-301, 

298-304, 305-311, 311-317, 315-321, 319-325, 322-328, 323-329, 

325-331,. 343-349, 354-360, 356-362, 374-380, 381-387, 383-389, 
387-393, 389-395, 395-401 

Cell attachment sequence. 

amino acids 301-304 
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FIGURE S3 



GGAGAAGAGGTTGTGTGGGACAAGCTGCTCCCGACAGAAGGATGTCGCTGCTGAGCCTGCCCTGG 
CTGGGCCTCAGACCGGTGGCAATGTCCCCATGGCTACTCCTGCTGCTGGTTGTGGGCTCCTGGCT 
ACTCGCCCGCATCCTGGCTTGGACCTATGCCTTCTATAACAACTGCGGCCGGCTCCAGTGTTTCC 
CACAGCCCCCAAAACGGAACTGGTTTTGGGGTCACCTGGGCCTGATCACTCCTACAGAGGAGGGC 
TTGAAGGACTCGACCCAGATGTCGGCCACCTATTCCCAGGGCTTTACGGTATGGCTGGGTCCGAT 
CATCCCCTTGATCGTTTTATGCCACCCTGACACCATCCGGTCTATCACCAATGCCTCAGCTGCCA 
TTGCACCCAAGGATAATCTCTTCATCAGGTTCCTGAAGCCCTGGCTGGGAGAAGGGATACTGCTG 
AGTGGCGGTGACAAGTGGAGCCGCCACCGTCGGATGCTGACGCCCGCCTTCCATTTCAAGATCCT 
GAAG TC CTATATAACGAT C T TCAACAAGAGT GCAAACATCAT GC T TGACAAGTGGCAGCACC TGG 
CC TCAGAGGGCAGCAG TCGT C T GGACAT GT T TGAGCACAT CAGCC TCAT GACCTTGGACAGT CTA 
CAGAAATGCATC TTCAGCTTTGACAGCCAT TGTGAGGAGAGGCCCAGT GAATATAT TGCCACCAT 
CTTGGAGCTCAGTGCCCTTGTAGAGAAAAGAAGCCAGCATATCCTCCAGCACATGGACTTTCTGT 
ATTACCTCTCCCATGACGGGCGGCGCTTCCACAGGGCCTGCCGCCTGGTGCATGACTTCACAGAC 
GCTGTCATCCGGGAGCGGCGTCGCAGCCTCCCCACTCAGGGTATTGATGATTTTTTCAAAGACAA 
AGCCAAGTCCAAGACTTTGGATTTCATTGATGTGCTTCTGCTGAGCAAGGATGAAGATGGGAAGG 
CATTGTCAGATGAGGATATAAGAGCAGAGGCTGACACCTTCATGTTTGGAGGCCATGACACCACG 
GCCAGTGGCCTCTCCTGGGTCCTGTACAACCTTGCGAGGCACCCAGAATACCAGGAGCGCTGCCG 
¥ I ACAGGAGGTGCAAGAGCT'TCTGAAGGACCGCGATCCTAAAGAGATTGAATGGGACGACCTGGCCC 

AGCTGCCCTTCCTGACCATGTGCGTGAAGGAGAGCCTGAGGTTACATCCCCCAGCTCCCTTCATC 
; |4 TCCCGATGCTGCACCCAGGACATTGTTCTCCCAGATGGCCGAGTCATCCCCAAAGGCATTACCTG 

CCTCATCGATATTATAGGGGTCCATCACAACCCAACTGTGTGGCCGGATCCTGAGGTCTACGACC 
CCTTCCGCTTTGACCCAGAGAACAGCAAGGGGAGGTCACCTCTGGCTTTTATTCCTTTCTCCGCA 
GGGCCCAGGAACTGCATCGGGCAGGCGTTCGCCATGGCGGAGATGAAAGTGGTCCTGGCGTTGAT 
GCTGCTGCACTTCCGGTTCCTGCCAGACCACACTGAGCCCCGCAGGAAGCTGGAATTGATCATGC 
GCGCCGAGGGCGGGCTTTGGCTGCGGGTGGAGCCCCTGAATGTAGGCTTGCAGTGACTTTCTGAC 
CCATCCACCTGTTTTTTTGCAGATTGTCATGAATAAAACGGTGCTGTCAAA 



III 

fl! 
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MSLLSLPWLGLRPVMSPWLLLLLWGSWLLARIIAWTYAFYNNCRRLQCFPQPPKRNWFWGHLG 
LITPTEEGLKDSTQMSATYSQGFTVWLGPIIPFIVLCHPDTIRSITNAS7\AIAPKDNLFIRFLKP 
WLGEGILLSGGDKWSRHRRMLTPAFHFNILKSYITIFNKSANIMLDKWQHLASEGSSRLDMFEHI 
SLMTLDSLQKCIFSFDSHCQERPSEYIATILELSALVEKRSQHILQHMDFLYYLSHDGRRFHRAC 
RLVHDFTDAVIRERRRTLPTQGIDDFFKDKAKSKTLDFIDVLLLSKDEDGKALSDEDIRAEADTF 
MFGGHDTTASGLSWLYNLARHPEYQERCRQEVQELLKDRDPKEIEWDDLAQLPFLTMCVKESLR 
LHPPAPFISRCCTQDIVLPDGRVIPKGITCLIDIIGVHHNPTVWPDPEVYDPFRFDPENSKGRSP 
lAFIPFSAGPRNCIGQAFAMMMKWiALMLLHFRFLPDHTEPRRKLELIMRAEGGLWLRVEPLN 
VGLQ 




Important features: 



Transmembrane domains: 



• w 
si 



amino acids 13-32 {type II), 77-102 




Cytochrome P450 cysteine heme -iron ligand signature. 

amino acids 461-471 




N-glycosylation sites . 

amino acids 112-116, 168-172 
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FIGURE ss 



E : 

Aft 

S»i? 

U 

Si 

Is! 



.l?-lif 

ir !*, ; 
5 ■*»?? 

a. 
p 

Ire 



AT C GCAT C AAT T GGGAGT AC CAT C T T C C T C ATGG GAC C AG T G AAACAG C T GAAGC GAAT G T T T GA 
GCCTACTCGTTTGATTGCAACTATCATGGTGCTGTTGTGTTTTGCACTTACCCTGTGTTCTGCCT 
TTTGGTGGCATAACAAGGGACTTGCACTTATCTTCTGCATTTTGCAGTCTTTGGCATTGACGTGG 
TACAGCCTTTCCTTCATACCATTTGCAAGGGATGCTGTGAAGAAGTGTTTTGCCGTGTGTCTTGC 
ATAATTCATGGCCAGT TT TATGAAGCT TTGGAAGGCACT AT GGACAGAAGCTGGTGGACAGT TT T 
GTAACTATCTTCGAAACCTCTGTCTTACAGACATGTGCCTTTTATCTTGCAGCAATGTGTTGCTT 
GTGATTCGAACATTTGAGGGTTACTTTTGGAAGCAACAATACATTCTCGAACCTGAATGTCAGTA 
GCACAGGATGAGAAGTGGGTTCTGTATCTTGTGGAGTGGAATCTTCCTCATGTACCTGTTTCCTC 
T C T G GAT G T T G T C C C AC T GAAT T C C CAT G AAT AC AAAC C TAT T C AGC AAC AG C AAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



APPJD 
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FIGURE 56 



MGPVKQLKRMFEPTRLIATIMVLLCFALTLCSAFWWHNKGLALIFCILQSLALTWYSLSFIPFAR 
DAVKKC FAVCLA 

Important features: 
Signal peptide: 

amino acids 1-33 

Type II fibronectin collagen-binding domain protein. 

amino acids 30-72 
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FIGURE 57 



CGGCTCGAGCTCGAGCCGAATCGGCTCGAGGGGCAGTGGAGCACCCAGCAGGCCGCCAAC ATG CTCTGTCTGTGCCTG 
TACGTGCCGGTCATCGGGGAAGCCCAGACCGAGTTCCAGTACTTTGAGTCGAAGGGGCTCCCTGCCGAGCTGAAGTCC 
AT TTTCAAGCTCAGT GT CT T CAT CCCCT CCC AGGAAT T CT CCACCTAC CGCCAGT GGAAGCAGAAAATT GTACAAGCT 
GGAGATAAGGAC CTT GAT GGGC AGCTAGACTTT GAAGAATT T GTCC ATTAT CT CC AAGAT CAT GAGAAGAAGCTGAGG 
CT GGT GT T T AAGATTT T GGACAAAAAGAAT GAT GGAC GCATT GACGCGCAGGAGAT CAT GCAGTC CCT GCG GGAC TTG 
GGAGTCAAGATATCTGAACAGCAGGCAGAAAAAATTCTCAAGAGCATGGATAAAAACGGCACGATGACCATCGACTGG 
AAC GAGT GGAGAGACTAC CACCTCCT C CACC C C GT G GAAAACATC CCC GAGAT CAT CCTCTACTGGAAGC ATT CCAC G 
AT CTTTGATGTGGGTGAGAATCTAACGGTCCC GGAT GAGTTCACAGTGGAGGAGAGGCAGACGGGGATGTGGTGGAGA 
CACCTGGTGGCAGGAGGTGGGGCAGGGGCCGTATCCAGAACCTGCACGGCCCCCCTGGACAGGCTCAAGGTGCTCATG 
CAGGTCCATGCCTCCCGCAGCAACAACATGGGCATCGTTGGTGGCTTCACTCAGATGATTCGAGAAGGAGGGGCCAGG 
TCACT CT GGC GGGG CAAT GGCAT CAAC GTCCT CAAAAT TGC CC CC GAATC AGC CAT CAAAT T CAT GG C CT AT GAGCAG 
ATCAAGCGCCTTGTTGGTAGTGACCAGGAGACTCTGAGGATTCACGAGAGGCTTGTGGCAGGGTCCTTGGCAGGGGCC 
AT C G C CCAGAGCAGCAT CT AC C CAATGGAGGT C CT GAAGAC C CGGAT GGCGCT GC GGAAGAC AGGCCAGT ACT CAGGA 
, AT G C T GGACTGC GCC AG GAGGAT CCT GGCCAGAGAGG GGGT GGCC GCCTT CTACAAAGGC TAT GT CC CCAACAT GCT G 

H GGCAT CAT CC CCT AT GC C GGCAT CGAC CTT GCAGT CT AC GAGACGCT CAAGAAT GC CT GGCT GCAGCACTAT GCAGT G 

p\ AACAGCGCGGACCCCGGCGTGTTTGTGCTCCTGGCCTGTGGCACCATGTCCAGTACCTGTGGCCAGCTGGCCAGCTAC 
55 CCCCT GGC CCTAGTCAGGACCC GGAT GCAGGCGCAAG C CT CTATT GAGGGCGCTC C GGAGGT GAC CAT GAGCAGCCT C 

W TTCAAACATATCCTGCGGACCGAGGGGGCCTTCGGGCTGTACAGGGGGCTGGCCCCCAACTTCATGAAGGTCATCCCA 
|f| GCTGTGAGCATCAGCTACGTGGTCTACGAGAACCTGAAGATCACCCTGGGCGTGCAGTCGCGG TGA CGGGGGGAGGGC 
f| CGCCCGGCAGTGGACTCGCTGATCCTGGGCCGCAGCCTGGGGTGTGCAGCCATCTCATTCTGTGAATGTGCCAACACT 
*W AAGCTGTCTCGAGCCAAGCTGTGAAAACCCTAGACGCACCCGCAGGGAGGGTGGGGAGAGCTGGCAGGCCCAGGGCTT 
hj GTCCTGCTGACCCCAGCAGACCCTCCTGTTGGTTCCAGCGAAGACCACAGGCATTCCTTAGGGTCCAGGGTCAGCAGG 
U n '{ CTCCGGGCTCACATGTGTAAGGACAGGACATTTTCTGCAGTGCCTGCCAATAGTGAGCTTGGAGCCTGGAGGCCGGCT 
W TAGTTCTTCCATTTCACCCTTGCAGCCAGCTGTTGGCCACGGCCCCTGCCCTCTGGTCTGCCGTGCATCTCCCTGTGC 
|!| CCTCTTGCTGCCTGCCTGTCTGCTGAGGTAAGGTGGGAGGAGGGCTACAGCCCACATCCCACCCCCTCGTCCAATCCC 

ATAATCCATGATGAAAGGTGAGGTCACGTGGCCTCCCAGGCCTGACTTCCCAACCTACAGCATTGACGCCAACTTGGC 
* TGTGAAGGAAGAGGAAAGGATCTGGCCTTGTGGTCACTGGCATCTGAGCCCTGCTGATGGCTGGGGCTCTCGGGCATG 
Fl CTTGGGAGTGCAGGGGGCTCGGGCTGCCTGGCCTGGCTGCACAGAAGGCAAGTGCTGGGGCTCATGGTGCTCTGAGCT 
ggig- GGC CT GGAC C CT GT CAGGAT GGG CCCCACCT CAGAACCAAACT CACT GTC C CC ACT GT GGCAT GAGGGCAGTGGAGCA 

CCATGTTTGAGGGCGAAGGGCAGAGCGTTTGTGTGTTCTGGGGAGGGAAGGAAAAGGTGTTGGAGGCCTTAATTATGG 
ACT GTTGGGAAAAGGGT T TT GT CCAGAAGGACAAGCC GGACAAAT GAGCGACT TCT GT GC TT C CAGAGGAAGAC GAG G 
GAGCAGGAGCTTGGCTGACTGCTCAGAGTCTGTTCTGACGCCCTGGGGGTTCCTGTCCAACCCCAGCAGGGGCGCAGC 
GGGACCAGCCCCACATTCCACTTGTGTCACTGCTTGGAACCTATTTATTTTGTATTTATTTGAACAGAGTTATGTCCT 
Q AACTATTTTTATAGATTTGTTTAATTAATAGCTTGTCATTTTCAAGTTCATTTTTTATTCATATTTATGTTCATGGTT 
2 GAT T GTAC CT T C C CAAGCCC GC CCAGT G GGAT GGGAGGAGGAGGAGAAGGGGG GC CT T GG GC CGCT G CAGT CACAT CT 

5 ^ GT C CAGAGAAATT CCTTT T G GGACT GGAGGCAGAAAAGCG GCCAGAAGGCAGCAGCC C T G GCT CCTTT CCTTT GGCAG 

GTTGGGGAAGGGCTTGCCCCCAGCCTTAGGATTTCAGGGTTTGACTGGGGGCGTGGAGAGAGAGGGAGGAACCTCAAT 
AACCTTGAAGGTGGAATCCAGTTATTTCCTGCGCTGCGAGGGTTTCTTTATTTCACTCTTTTCTGAATGTCAAGGCAG 
TGAGGTGCCTCTCACTGTGAATTTGTGGTGGGCGGGGGCTGGAGGAGAGGGTGGGGGGCTGGCTCCGTCCCTCCCAGC 
CTTCTGCTGCCCTTGCTTAACAATGCCGGCCAACTGGCGACCTCACGGTTGCACTTCCATTCCACCAGAATGACCTGA 
T GAGGAAAT CT T C AAT AGGAT GCAAAGAT CAAT GCAAAAATT GT TATATAT GAACATATAAC T GGAGT C GT CAAAAAG 
CAAATTAAGAAAGAATT GGACGT T AGAAGT TGT CATTTAAAGCAGCCTT CTAATAAAGTT GTTTCAAAGCTGAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE *8 



MLCLCLYVPVIGEAQTEFQYFESKGLPAELKSIFKLSVFIPSQEFSTYRQWKQKIVQAGDKDLDG 
QLDFEEFVHYLQDHEKKLRLVFKILDKKNDGRIDAQEIMQSLRDLGVKISEQQAEKILKSMDKNG 
TMT I DWNE WRD YHLLHPVENI PE 1 1 LYWKH.S T I FDVGENLT VPDE FT VEERQTGMWWRHLVAGGG 
AGAVSRTCTAPLDRLKVLMQVHASRSNNMGIVGGFTQMIREGGARSLWRGNGINVLKIAPESAIK 
FMAYEQIKRLVGSDQETLRIHERLVAGSLAGAIAQSSIYPMEVLKTRMALRKTGQYSGMLDCARR 
ILAREGVAAFYKGYVPNMLGIIPYAGIDLAVYETLKNAWLQHYAVNSADPGVFVLLACGTMSSTC 
GQLASYPLALVRTRMQAQASIEGAPEVTMSSLFKHILRTEGAFGLYRGLAPNFMKVIPAVSISYV 




VYENLKI TLGVQSR 



Important features : 



Signal peptide: 



amino acids 1-16 



Putative transmembrane domains: 




Mitochondrial energy transfer proteins signature. 

amino acids 206-215, 300-309 



N-glycosylation sites. 

amino acids 129-133, 169-173 



amino acids 284-304, 339-360, 376-394 



Elongation Factor-hand calcium-binding protein. 

amino acids 54-73, 85-104, 121-140 
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FIGURE so 



GGAAGGCAGCGGCAGCTCCACTCAGCCAGTACCCAGATACGCTGGGAACCTTCCCCAGCCATGGC 
TTCCCTGGGGCAGATCCTCTTCTGGAGCATAATTAGGATCATCATTATTCTGGCTGGAGCAATTG 
CACTCATCATTGGCTTTGGTATTTCAGGGAGACACTCCATCACAGTCACTACTGTCGCCTCAGCT 
GGGAACAT TGGGGAGGAT GGAATC C T GAGCT GCACT T T T GAACCTGACAT CAAAC TT T CTGATAT 
CGTGAT ACAAT GGCT GAAGGAAGGTGTTT TAGGC TT GGT C CAT GAGT T CAAAGAAGGCAAAGATG 
AGC TGT CGGAGCAGGATGAAATGT TCAGAGGCCGGACAGCAGT GT T T GC T GATCAAGT GATAGT T 
GGCAATGCC TCTTTGCGGCT GAAAAACGT GCAAC TCACAGATGCTGGCACCT ACAAATGT TATAT 
j*» CATCACTTCTAAAGGCAAGGGGAATGCTAACCTTGAGTATAAAACTGGAGCCTTCAGCATGCCGG 
fcl AAGTGAATGTGGACTATAATGCCAGCTCAGAGACCTTGCGGTGTGAGGCTCCCCGATGGTTCCCC 
2?- CAGCCCACAGTGGTCTGGGCATCCCAAGTTGACCAGGGAGCCAACTTCTCGGAAGTCTCCAATAC 

m 

U\ CAGCTTTGAGCTGAACTCTGAGAATGTGACCATGAAGGTTGTGTCTGTGCTCTACAATGTTACGA 
%l TC AACAACACATACT CC T G TAT GAT T GAAAATGACAT T GCCAAAGCAACAGGGGATATCAAAGT G 

ACAGAAT CGGAGATCAAAAGGCGGAGT CACCTACAGCT GCTAAAC TCAAAGGC TTCTCTGTGTGT 
C TC T TCT T T CT TTGCCAT CAGC TGGGCAC TT C T GCC T C T CAGC CC T TACCT GATGC TAAAATAAT 
1, GTGCCTTGGC CAC AAAAAAG CAT G C AAAG T CAT TGT T ACAAC AG GGAT C TACAGAAC TAT T T C AC 

u 

g si ' CACCAGATATGACCTAGTTTTATAT T TCT GGGAGGAAATGAAT TCATAT CTAGAAGTCTGGAGTG 



i us 



AGCAAAC AAGAGC AAGAAAC AAAAAGAAG C C AAAAGCAGAAG GC TC C AATAT GAAC AAGAT AAAT 
III C TATCTT CAAAGACATAT TAGAAGT TGGGAAAATAAT T CAT GT GAAC TAGACAAGTGT GT T AAGA 

m 

M GTGATAAGTAAAATGCACGTGGAGACAAGTGCATCCCCAGATCTCAGGGACCTCCCCCTGCCTGT 

CACCTGGGGAGTGAGAGGACAGGATAGTGCATGTTCTTTGTCTCTGAATTTTTAGTTATATGTGC 
TGTAATGT TGC TC TGAGGAAGCCC C T G GAAAGT C TAT CC CAACATAT CCACATC T TATATT CCAC 
AAATTAAGCTGTAGTATGTACCCTAAGACGCTGCTAATTGACTGCCACTTCGCAACTCAGGGGCG 
GCTGCATTTTAGTAATGGGTCAAATGATTCACTTTTTATGATGCTTCCAAAGGTGCCTTGGCTTC 
TCT T CCCAACTGACAAAT GCCAAAGT TGAGAAAAAT GAT CATAAT T T T AGCATAAACAGAGCAGT 
CGGGGACACCGATTTTATAAATAAACTGAGCACCTTCTTTTTAAACAAAAAAAAAAAAAAA?^AAA 
AAAAAAAAAAAAAAAAAAAAAAA?\AAAAAAAAA 
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FIGURE 60 



MASLGQILFWSIISIIIILAGAIALIIGFGISGRHSITVTTVASAGNIGEDGILSCTFEPDIKLS 
DIVIQWLKEGVLGLVHEFKEGKDELSEQDEMFRGRTAVFADQVIVGNASLRLKNVQLTDAGTYKC 
YIITSKGKGNANLEYKTGAFSMPEVNVDYNASSETLRCEAPRWFPQPTWWASQVDQGANFSEVS 
NTSFELNSENVTMKWSVLYNVTINNTYSCMIENDIAKATGDIKVTESEIKRRSHLQLLNSKASL 
CVS S FFAI SWALLPLS P YLMLK 

Important features : 



N-glycosylation sites . 

amino acids 112-116, 160-164, 190-194, 196-200, 205-209, 216-220, 



few if 



Signal peptide: 



amino acids 1-28 




Transmembrane domain: 



amino acids 258-281 



ill 



220-224 




N-myristoylation sites. 



amino acids 52-58, 12 6-132, 188-194 
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FIGURE 6l 



TGACGTCAGAATCACCATGGCCAGC TATCCT TACCGGCAGGGCTGCCCAGGAGC TGCAGGACAAG 
CACCAGGAGCCCCTCCGGGTAGCTACTACCCTGGACCCCCCAATAGTGGAGGGCAGTATGGTAGT 
GGGCTACCCCCTGGTGGTGGTTATGGGGGTCCTGCCCCTGGAGGGCCTTATGGACCACCAGCTGG 
TGGAGGGCCCTATGGACACCCCAATCCTGGGATGTTCCCCTCTGGAACTCCAGGAGGACCATATG 
GCGGTGCAGCTCCCGGGGGCCCCTATGGTCAGCCACCTCCAAGTTCCTACGGTGCCCAGCAGCCT 
GGGCTTTATGGACAGGGTGGCGCCCCTCCCAATGTGGATCCTGAGGCCTACTCCTGGTTCCAGTC 
GGTGGACTCAGATCACAGTGGCTATATCTCCATGAAGGAGCTAAAGCAGGCCCTGGTCAACTGCA 
ATTGGTCTTCATTCAATGATGAGACCTGCCTCATGATGATAAACATGTTTGACAAGACCAAGTCA 
GGCCGCATCGATGTCTACGGCTTCTCAGCCCTGTGGAAATTCATCCAGCAGTGGAAGAACCTCTT 
CCAGCAGTATGAGCGGGACCGCTCGGGCTCCATTAGGTACACAGAGCTGCAGCAAGCTCTGTCCC 
fl AAATGGGCTACAACCTGAGCCCCCAGTTCACCCAGCTTCTGGTCTCCCGCTACTGCCCACGCTCT 
%i GCCAATCCTGCCATGCAGCTTGACCGCTTCATCCAGGTGTGCACCCAGCTGCAGGTGCTGACAGA 

GGCCTTCCGGGAGAAGGACACAGCTGTACAAGGCAACATCCGGCTCAGCTTCGAGGACTTCGTCA 
CCATGACAGCTTCTCGGATGCT ATGA CCCAACCATCTGTGGAGAGTGGAGTGCACCAGGGACCTT 
TCCTGGCTTCTTAGAGTGAGAGAAGTATGTGGACATCTCTTCTTTTCCTGTCCCTCTAGAAGAAC 
ATTCTCCCTTGCTTGATGCAACACTGTTCCAAAAGAGGGTGGAGAGTCCTGCATCATAGCCACCA 



■SSIS? 



¥1* 



fl AATAGT GAGGACCGGGGC T GAGGCCACACAGATAGGGGCC TGATGGAGGAGAGGATAGAAGT T GA 



ATGTCCTGATGGCCATGAGCAGTTGAGTGGCACAGCCTGGCACCAGGAGCAGGTCCTTGTAATGG 
AGTTAGTGTCCAGTCAGCTGAGCTCCACCCTGATGCCAGTGGTGAGTGTTCATCGGCCTGTTACC 
GTTAGTACCTGTGTTCCCTCACCAGGCCATCCTGTCAAACGAGCCCATTTTCTCCAAAGTGGAAT 
CTGACCAAGCATGAGAGAGATC TGTC TAT GGGACCAGTGGCT T GGAT TCTGCC ACAC CCATAAAT 
CCTTGTGTGTTAACTTCTAGCTGCCTGGGGCTGGCCCTGCTCAGACAAATCTGCTCCCTGGGCAT 
CTTTGGCCAGGCTTCTGCCCCCTGCAGCTGGGACCCCTCACTTGCCTGCCATGCTCTGCTCGGCT 
TCAGTCTCCAGGAGACAGTGGTCACCTCTCCCTGCCAATACTTTTTTTAATTTGCATTTTTTTTC 
ATTTGGGGCCAAAAGTCCAGTGAAATTGTAAGCTTCAATAAAAGGATGAAACTCTGA 
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FIGURE 62 

MASYPYRQGCPGAAGQAPGAPPGSYYPGPPNSGGQYGSGLPPGGGYGGPAPGGPYGPPAGGGPYG 
HPNPGMFPSGTPGGPYGGAAPGGPYGQPPPSSYGAQQPGLYGQGGAPPNVDPEAYSWFQSVDSDH 
SGYISMKELKQALWCNWSSFNDETCLMMINMFDKTKSGRIDVYGFSALWKFIQQWKNLFQQYDR 
DRSGSISYTELQQALSQMGYNLSPQFTQLLVSRYCPRSANPAMQLDRFIQVCTQLQVLTEAFREK 
DTAVQGNIRLS FEDFVTMTASRML 

Important features of the protein : 
Signal peptide: 

amino acids 1-19 

N-glycosylation site. 

amino acids 147-150 

Casein kinase II phosphorylation sites. 

amino acids 135-138, 150-153, 202-205, 271-274 

N-myristoylation sites. 

amino acids 9-14, 15-20, 19-24, 33-38, 34-39, 39-44, 43-48, 61- 
66, 70-75, 78-83, 83-88, 87-92, 110-115 
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FIGURE 6 s 



CAGGATGCAGGGCCGCGTGGCAGGGAGCTGCGCTCCTCTGGGCCTGCTCCTGGTCTGTCTTCATC 
TCCCAGGCCTCTTTGCCCGGAGCATCGGTGTTGTGGAGGAGAAAGTTTCCCAAAACTTCGGGACC 
AACTTGCCTCAGCTCGGACAACCTTCCTCCACTGGCCCCTCTAACTCTGAACATCCGCAGCCCGC 
TCTGGACCCTAGGTCTAATGACTTGGCAAGGGTTCCTCTGAAGCTCAGCGTGCCTCCATCAGATG 
GCTTCCCACCTGCAGGAGGTTCTGCAGTGCAGAGGTGGCCTCCATCGTGGGGGCTGCCTGCCATG 
GATTCCTGGCCCCCTGAGGATCCTTGGCAGATGATGGCTGCTGCGGCTGAGGACCGCCTGGGGGA 
AGCGCTGCCTGAAGAACTCTCTTACCTCTCCAGTGCTGCGGCCCTCGCTCCGGGCAGTGGCCCTT 
Li TGCCTGGGGAGTCTTCTCCCGATGCGACAGGCCTCTCACCTGAGGCTTCACTCCTCCACCAGGAC 

TCGGAGTCCAGACGACTGCCCCGTTCTAATTCACTGGGAGCCGGGGGAAAAATCCTTTCCCAACG 



j{ CCCTCCCTGGTCTCTCATCCACAGGGTTCTGCCTGATCACCCCTGGGGTACCCTGAATCCCAGTG 

TGTCCTGGGGAGGTGGAGGCCCTGGGACTGGTTGGGGAACGAGGCCCATGCCACACCCTGAGGGA 



3 .1 i 

%| AT C T GGG G TAT CAAT AAT C AAC CC C C AGG T AC CAGC T GG GGAAAT AT T AAT CGG TAT C CAGGAGG 



CAGCTGGGGAAATATTAATCGGTATCCAGGAGGCAGCTGGGGGAATATTAATCGGTATCCAGGAG 



W 

Ji* GCAGCTGGGGGAATATTCATCTATACCCAGGTATCAATAACCCATTTCCTCCTGGAGTTCTCCGC 



CCTCCTGGCTCTTCTTGGAACATCCCAGCTGGCTTCCCTAATCCTCCAAGCCCTAGGTTGCAGTG 
G GGC TAGAG CAC GAT AGAGGGAAAC C C AAC AT T GGGAGT T AGAG TC C TGCTCCCGCCCCTTGCTG 



V\ TGTGGGCTCAATCCAGGCCGTGTTAACATGTTTCCAGCACTATCCCCACTTTTCAGTGCCTCCCC 
IB TGCTCAT CTCC?U\TAAAATAAAAGCACT TATGAAAAAA?^AAAAAAAAAAAA7\AAA 

AAAAAAAAAAAAAAAAAA&AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 64 



MQGRVAGSCAPLGLLLVCLHLPGLFARSIGWEEKVSQNFGTNLPQLGQPSSTGPSNSEHPQPAL 
DPRSNDLARVPLKLSVPPSDGFPPAGGSAVQRWPPSWGLPj^DSWPPEDPWQMMAAAAEDRLGEA 
LPEELSYLSSAAALAPGSGPLPGESSPDATGLSPEASLLHQDSESRRLPRSNSLGAGGKILSQRP 
PWSLIHRVLPDHPWGTLNPSVSWGGGGPGTGWGTRPMPHPEGIWGINNQPPGTSWGNINRYPGGS 
WGNINRYPGGSWGNINRYPGGSWGNIHLYPGINNPFPPGVLRPPGSSWNIPAGFPNPPSPRLQWG 

Important features of the protein: 
Signal peptide: 

amino acids 1-26 

Casein kinase II phosphorylation sites. 

amino acids 56-59, 155-158 

N-myristoylation sites* 

amino acids 48-53, 220-225, 221-226, 224-229, 247-252, 258-263, 
259-264, 269-274, 270-275, 280-285, 281-286, 305-310 
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FIGURE 6s 
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AAGGAGAGGCCACCGGGACTTCAGTGTCTCCTCCATCCCAGGAGCGCAGTGGCCACTATGGGGTC 
TGGGCTGCCCCTTGTCCTCCTCTTGACCCTCCTTGGCAGCTCACATGGAACAGGGCCGGGTATGA 
CTTTGCAACTGAAGCTGAAGGAGTCTTTTCTGACAAATTCCTCCTATGAGTCCAGCTTCCTGGAA 
TTGCTTGAAAAGCTCTGCCTCCTCCTCCATCTCCCTTCAGGGACCAGCGTCACCCTCCACCATGC 
AAGATCTCAACACCATGTTGTCTGCAACACATGACAGCCATTGAAGCCTGTGTCCTTCTTGGCCC 
GGGCTTTTGGGCCGGGGATGCAGGAGGCAGGCCCCGACCCTGTCTTTCAGCAGGCCCCCACCCTC 
C TGAGT G GCAATAAAT AAAAT T C GG T AT G C T G 
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FIGURE 66 

MGSGLPLVLLLTLLGSSHGTGPGMTLQLKLKESFLTNSSYESSFLELLEKLCLLLHLPSGTSVTL 
HHARS QHHWCNT 



Important features: 
Signal peptide: 

amino acids 1-19 

N-glycosylation site. 

amino acids 37-41 

N-myristoylation sites. 

amino acids 15-21, 19-25, 60-66 
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FIGURE 67 



ACGGACCGAGGGTTCGAGGGAGGGACACGGACCAGGAACCTGAGCTAGGTCAAAGACGCCCGGGC 
CAGGTGCCCCGTCGCAGGTGCCCCTGGCCGGAGATGCGGTAGGAGGGGCGAGCGCGAGAAGCCCC 
TTCCTCGGCGCTGCCAACCCGCCACCCAGCCCATGGCGAACCCCGGGCTGGGGCTGCTTCTGGCG 
CTGGGCCTGCCGTTCCTGCTGGCCCGCTGGGGCCGAGCCTGGGGGCAAATACAGACCACTTCTGC 
AAATGAGAATAGCACTGTTTTGCCTTCATCCACCAGCTCCAGCTCCGATGGCAACCTGCGTCCGG 
AAGCCATCACTGCTATCATCGTGGTCTTCTCCCTCTTGGCTGCCTTGCTCCTGGCTGTGGGGCTG 
GCACTGTTGGTGCGGAAGCTTCGGGAGAAGCGGCAGACGGAGGGCACCTACCGGCCCAGTAGCGA 
GGAGCAGTTCTCCCATGCAGCCGAGGCCCGGGCCCCTCAGGACTCCAAGGAGACGGTGCAGGGCT 
GCCTGCCCATCTAGGTCCCCTCTCCTGCATCTGTCTCCCTTCATTGCTGTGTGACCTTGGGGAAA 
GGCAGTGCCCTCTCTGGGCAGTCAGATCCACCCAGTGCTTAATAGCAGGGAAGAAGGTACTTCAA 
AGACTCTGCCCCTGAGGTCAAGAGAGGATGGGGCTATTCAC TT TTATATAT TTATATAAAAT TAG 
TAGTGAGATGTAAAAAAAAAAAAAAAAAA 
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FIGURE 68 

MANPGLGLLIALGLPFLLARWGRAWGQIQTTSANENSTVLPSSTSSSSDGNLRPEAITAIIWFS 
LLAALLLAVGLALLVRKLREKRQTEGTYRPSSEEQFSHAAEARAPQDSKETVQGCLPI 



Important features: 
Signal peptide: 

amino acids 1-19 

Transmembrane domain: 

amino acids 56-80 

N-glycosylation site. 

amino acids 36-40 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 86-90 

Tyrosine kinase phosphorylation site. 

amino acids 86-94 

N-myristoylation sites. 

amino acids 7-13, 26-32 
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FIGURE 6q 



GC CAGGAAT AACTAGAGAGGAACA ATG G GGTTATT C AGAGGTT TT GTT TTCCT CTTAGTT CT GTGCCT GCT GCAC CAG 
TCAAATACTTCCTTCATTAAGCTGAATAATAATGGGTTTGAAGATATTGTCATTGTTATAGATCCTAGTGTGCCAGAA 
GAT GAAAAAATAATT GAACAAAT AGAGGAT AT GGT GACT ACAGCT T CT ACGTACCT GTTT GAAGCCACAGAAAAAAGA 
TTTTTTT T CAAAAAT GTAT CTAT ATTAATT C CT GAGAATT G GAAGGAAAAT C CTCAGTACAAAAGGC CAAAACAT GJkA 
AACCAT AAACAT GCT GAT GT TATAGTT GCAC CACCTACACT C C CAG GT AGAGATGAACCATACAC CAAGCAGTT CACA 
GAATGT GGAGAGAAAG GCGAATACATT CACT T CACC C CT GAC CTT CTACTT GGAAAAAAACAAAAT GAATAT GGACCA 
C CAGGCAAACT GT TT GT CCAT GAGTGG GCT CACCT CC GGT G GGGAGT GTT T GATGAGTACAAT GAAGAT CAGC CT TT C 
TACCGT GCTAAGT CAAAAAAAAT CGAAGCAAC AAGGT GTT C C GCAGGTAT C T CTG GTAGAAATAGAGT TT ATAAGT GT 
CAAGGAGGCAGC T GT CTTAGTAGAGCAT GCAGAATT GATT CTACAACAAAACT GTAT GGAAAAGATT GT CAATTCT TT 
CCT GATAAAGTACAAACAGAAAAAGC AT CCATAAT GT T TAT GCAAAGTAT T GATT CT GTT GTT GAATT TT GT AAC GAA 
AAAACCCATAATCAAGAAGCT CCAAG CCT ACAAAACAT AAAGT GCAATTT TAGAAGT ACAT G GGAG GT GAT T AGC AAT 
T CT GAG GATTT TAAAAACACCATACC CATGGT GACACCAC CT C CT CCACCT GT CT T CT CATTGCT GAAGATCAGT CAA 
AGAAT T GT GT GCTTAGTT CTT GAT AAGT CT GGAAGCAT GGGGG GT AAGGACCGCCT AAAT CGAAT GAATCAAG CAGCA 
jnlf AAACATT T CCT GCTGCAGACT GT TGAAAAT GGAT CCT GGGT GG GGAT GGT T C ACT TT GATAGT ACT GCCACT ATT GT A 

f*% AATAAGCTAAT CCAAATAAAAAGCAGT GAT GAAAGAAACACACTCAT GGCAGGAT TACCT AC ATAT CCTCT GGGAGGA 

. ACT TCCAT CT GCT CT GGAATTAAATAT GCATTT CAGGT GATT GGAGAGCTACATT C CC AACTC GATGGAT C CGAAGTA 
!Q CT GCT GCT GACT GAT GG GGAGGATAACACT GCAAGT T CTT GT ATT GAT GAAGT GAAACAAAGT GGGGC CATT GTT CAT 

f || TTTATTGCTTTGGGAAGAGCTGCTGATGAAGCAGTAATAGAGATGAGCAAGATAACAGGAGGAAGTCATTTTTATGTT 
"fl TCAGATGAAGCTCAGAACAATGGCCTCATTGATGCTTTTGGGGCTCTTACATCAGGAAATACTGATCTCTCCCAGAAG 
|&J TCCCTTCAGCTCGAAAGTAAGGGATTAACACTGAATAGTAATGCCTGGATGAACGACACTGTCATAATTGATAGTACA 
^ J GT GGGAAAGGACACGTT CTTT CT CAT CACAT GGAACAGTCT GCCT C CCAGTATTT C TCTCT GGGAT CCCAGTGGAACA 

s ^ ATAATGGAAAATTTCACAGTGGATGCAACTTCCAAAATGGCCTATCTCAGTATTCCAGGAACTGCAAAGGTGGGCACT 

T GGGCAT ACAAT CTT CAAG C CAAAGC GAAC CCAGAAACATTAACTATT ACAGTAACTT CT CGAGCAGCAAAT T CTT CT 
ffi GT GCCT CCAAT C ACAGT GAAT GCTAAAAT GAAT AAGGACGTAAACAGT TT CCCCAGCC CAATGAT T GTTT ACGCAGAA 

K ATTCTACAAGGATATGTACCTGTTCTTGGAGCCAATGTGACTGCTTTCATTGAATCACAGAATGGACATACAGAAGTT 
» T T G GAACT TTT GGATAAT GGT GCAGG CGCT GAT T CTTT CAAGAAT GAT G GAGT CTACT CCAGGTAT TTTACAGCAT AT 

jp| AC AGAAAATG GC AGATAT AGCTTAAAAGTT C GG GCT CAT G GAGGAGCAAACACT GC CAGGCTAAAATT ACGGC CT CCA 

j** CTGAATAGAG CCGC GTACAT ACCAGGCT G GGTAGT GAAC G GGGAAATT GAAGCAAACC C GC CAAGACCT GAAAT T GAT 

ill GAGGATACTCAGACCACCTTGGAGGATTTCAGCCGAACAGCATCCGGAGGTGCATTTGTGGTATCACAAGTCCCAAGC 

CTT CCCTT GC CTGACCAATACCCACCAAGT CAAAT CACAGACCTT GAT GCCACAGTT CAT GAGGATAAGATTAT T CTT 
ACATGGACAGCACCAGGAGATAATTTTGAT GTT GGAAAAGT T CAACGTTATAT CAT AAGAAT AAGT G CAAGTAT T CT T 
GAT CT AAGAGAC AGT TTT GAT GAT GC TCT T CAAGTAAATACTAC T GAT C T GTCAC CAAAGGAGG CCAACT CCAAGGAA 
AGCTTT GCAT T T AAACCAGAAAATAT CT CAGAAGAAAAT GCAAC C CACATATT TATT GCC ATTAAAAGTATAGATAAA 
AGCAAT TT GACAT CAAAAGTATC CAACAT T GCACAAGT AACTTT GT TTAT CCCT CAAGCAAAT CCT GAT GACATT GAT 
CCTACACCTACTC CTACT CCTACT CCTACT CCT GAT AAAAGT CATAAT T CTGGAGTTAATATT T C TAC GCT GGTATTG 
T CT GT GATT GGGT CT GTT GT AATT GT T AACT TT ATT TTAAGTACCACCATT TGAA C CT TAACGAAGAAAAAAAT CTTC 
AAGT AGAC CTAGAAGAGAGT TTTAAAAAACAAAACAAT GT AAGTAAAGGAT AT T T CTGAAT CT TAAAAT T CAT CC CAT 
GT GT GATCATAAACT CATAAAAATAATTT TAAGAT GT CGGAAAAGGATACT TT GATTAAATAAAAACACT CAT GGATA 
T GT AAAAACT GTCAAGATT AAAATT TAATAGTT TCATT TATTT GTTAT TT TAT T T GTAAGAAATAGT GAT GAACAAAG 
ATCCT T TT T CATACT GATAC CTGGT T GTATATT AT TT GAT GCAACAGT TT T CT GAAAT GATAT T T CAAATT GCAT CAA 
GAAATTAAAAT CAT CTAT CT GAGTAGT CAAAAT ACAAGTAAAGGAGAGC AAATAAACAACATT T GGAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 70 



r 3: = 



MGLFRGFVFLLVLCLLHQSNTSFIKLNNNGFEDIVIVIDPSVPEDEKIIEQIEDMVTTASTYLFE 
ATEKRFFFKNVSILIPENWKENPQYKRPKHENHKHADVIVAPPTLPGRDEPYTKQFTECGEKGEY 
IHFTPDLLLGKKQNEYGPPGKLFVHEWAHLRWGVFDEYNEDQPFYRAKSKKIEATRCSAGISGRN 
RVYKCQGGSCLSRACRIDSTTKLYGKDCQFFPDKVQTEKASIMFMQSIDSWEFCNEKTHNQEAP 
SLQNIKCNFRSTWEVISNSEDFKNTIPMVTPPPPPVFSLLKISQRIVCLVLDKSGSMGGKDRLNR 
MNQAAKHFLLQTVENGSWGMVHFDSTATIVNKLIQIKSSDERNTLMAGLPTYPLGGTSICSGIK 
YAFQVIGELHSQLDGSEVLLLTDGEDNTASSCIDEVKQSGAIVHFIALGRAADEAVIEMSKITGG 
SHFYVSDEAQNNGLIDAFGALTSGNTDLSQKSLQLESKGLTLNSNAWMNDTVIIDSTVGKDTFFL 
I TWNSLPPS I SLWDPSGT IMENFTVDATSKMAYLS I PGTAKVGTWAYNLQAKANPETLT I TVTSR 
C! AANSSVPPITWAKl^KDVNSFPSPMIWAEILQGYVPVLGANVTAFIESQNGHTEVLELLDNGA 

m 

f I GADSFKNDGWSRYFTAYTENGRYSLKVRAHGGANTARLKLRPPLNRAAYIPGWVVNGEIEANPP 

RPEIDEDTQTTLEDFSRTASGGAFWSQVPSLPLPDQYPPSQITDLDATVHEDKIILTWTAPGDN 
yi FDVGKVQRYI IRISAS ILDLRDS FDDALQVNTTDLS PKEANSKES FAFKPENI SEENATHI FIAI 

HI KSIDKSNLTSKVSNIAQVTLFIPQANPDDIDPTPTPTPTPTPDKSHNSGVNISTLVLSVIGSWI 
K VNFILSTTI 

I'll 

■ Signal peptide: 

amino acids 1-21 

Putative transmembrane domains: 

amino acids 284-300, 617-633 

Leucine zipper pattern. 

amino acids 469-491, 476-498 

N-glycosylation site. 

amino acids 20-24, 75-79, 340-344, 504-508, 542-546, 588-592, 
628-632, 811-815, 832-836, 837-841, 852-856, 896-900 
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FIGURE 71 



CT CCTTAGGT GGAAACC CTGGGAGTAGAGTACT GAC AGCAAAGAC CGGGAAAGACCATAC GTCCC CGGG CAGGG GT GA 
CAACAGGTGTCATCTTTTTGATCTCGTGTGTGGCTGCCTTCCTATTTCAAGGAAAGACGCCAAGGTAATTTTGACCCA 
GAGGAGCAAT GAT GTAGCCACCT CCTAACCT TC CCTTC TT GAACC C CCAGTT AT GCCAGGATTTACTAGAGAGT GT CA 
ACTCAACCAGCAAGCGGCTCCTTCGGCTTAACTTGTGGTTGGAGGAGAGAACCTTTGTGGGGCTGCGTTCTCTTAGCA 
GTGCTCAGAAGTGACTTGCCTGAGGGTGGACCAGAAGAAAGGAAAGGTCCCCTCTTGCTGTTGGCTGCACATCAGGAA 
GGCTGTGATGGGAATGAAGGTGAAAACTTGGAGATTTCACTTCAGTCATTGCTTCTGCCTGCAAGATCATCCTTTAAA 
AGTAGAGAAGCT GCT C T GTGT GGT GGTTAACT CC AAGAGGCAGAACT CGTT CTAGAAGGAAAT GGATG CAAGCAGCT C 
CGGGGGCCCCAAACGCATGCTTCCTGTGGTCTAGCCCAGGGAAGCCCTTCCGTGGGGGCCCCGGCTTTGAGGGATGCC 
ACCGGTTCTGGACGCATGGCTGATTCCTG AATGA TGATGGTTCGCCGGGGGCTGCTTGCGTGGATTTCCCGGGTGGTG 
GTTTTGCTGGTGCTCCTCTGCTGTGCTATCTCTGTCCTGTACATGTTGGCCTGCACCCCAAAAGGTGACGAGGAGCAG 
CTGGCACTGCCCAGGGCCAACAGCCCCACGGGGAAGGAGGGGTACCAGGCCGTCCTTCAGGAGTGGGAGGAGCAGCAC 
CGCAACTACGT GAGCAGC CT GAAGC GGCAGATCGCACAGCT CAAGGAGGAGCT GCAGGAGAGGAGT GAGCAGCT C AGG 
AAT GGG CAGTACCAAGCCAGC GATGCT GCT GGCCT GGGT CT GGACAGGAGCCC C CCAGAGAAAACCCAGGCCGACCTC 
CT GGCCT T CCT GCAC T CGCAGGT GGACAAGGCAGAGGT GAATGCT GGCGT CAAGCT GG C CACAGAGTAT GCAGCAGTG 
CCT TT C GATAGCTT TACT CTACAGAAGGT GTACCAGCT G GAGACT GGCC T T ACC C GCCAC CCCGAGGAGAAGC CT GTG 
AGGAAGGACAAGC GG GAT GAGTT G GT GGAAG CCAT T GAAT CAG C CTTG GAGACC CT GAACAAT CCT GCAGAGAACAGC 
&af CC CAAT CACC GTCCT TACACGGC CT CT GATTT CATAGAAGGGAT CTAC CGAACAGAAAGGGACAAAGGGACAT T GT AT 

y 1 GAGCT CACCTT CAAAGGGGAC CACAAACACGAATT CAAACGGCT CAT C T T ATT T C GAC CAT TCAGC CCCAT CAT GAAA 

GTGAAAAATGAAAAGCTCAACATGGCCAACACGCTTATCAATGTTATCGTGCCTCTAGCAAAAAGGGTGGACAAGTTC 
CGGCAGTT CAT GCAGAAT TT CAG GGAGAT GT GCAT T GAGCAGGAT GGGAGAGT CCATCT CACT GTT GT TTACTTT GGG 
\j AAAGAAGAAATAAAT GAAGT CAAAGGAATACTT GAAAAC ACTT CCAAAGCT GCCAACTT CAGGAACTTTACCTTCATC 

3 .. \ CAGCTGAATGGAGAATTTTCTCGGGGAAAGGGACTTGATGTTGGAGCCCGCTTCTGGAAGGGAAGCAACGTCCTTCTC 
JJ? TTTTTCTGTGATGTGGACATCTACTTCACATCTGAATTCCTCAATACGTGTAGGCTGAATACACAGCCAGGGAAGAAG 
p|| GTATTTTATCCAGTTCTTTTCAGTCAGTACAATCCTGGCATAATATACGGCCACCATGATGCAGTCCCTCCCTTGGAA 

CAGCAGCT GGT CAT AAAGAAGGAAACTGGAT TTT GGAGAGACTTT GGATT T GGGAT GACGT GT CAGTAT CGGTCAGAC 
T TCAT CAATAT AGGT G GGTT T GATCT GGACAT CAAAGGCT GGGGC G GAGAGGAT GT GCAC CTT TAT CG CAAGTAT CTC 
IJI CACAGCAACCT CATAGT GGT ACGGAC GCCT GT G CGAGGACT CT T C CAC CT C T GGCATGAGAAG C GCTG CAT GGACGAG 

| K CTGACCCC CGAGCAGTACAAGAT GTG CAT GCAGT CCAAGGC CAT GAAC GAGGCAT CCCACGGCCAGCT GGGCAT GCTG 

GT GTT CAGGC ACGAGATAGAGGCT C ACCT T CGCAAACAGAAACAGAAGACAAGTAG CAAAAAAACATGAACTC C CAGA 
GAAGGATT GT GGGAGACACTTTTT CTTT C CTTT T GCAATTACT GAAAGT GGCT GCAAC AGAGAAAAGACTT CCATAAA 
GGACGACAAAAGAATT GGACT GAT GGGT CAGAGAT GAGAAAGC CT C CGATT TCT CT CT GTT GG GCTTTT TACAACAGA 
AAT CAAAAT CT CC GCT TTGCCTGCAAAAGTAACC CAGT T GCAC CCT GT GAAGT GT CTGACAAAGGCAGAAT GCT T GTG 
AGAT TATAAGC CTAAT GGT GT GGAGGTTTTGAT GGT GT T T ACAATACACT GAGACCTGTT GTTT T GT GT GCTCATT GA 
AATATT CAT GATTTAAGAGCAGTT TT GTAAAAAAT T CATT AGCAT GAAAGGCAAGCAT AT T T C T CCT CATAT GAAT GA 
GCC T AT CAGC AGGGCT CTAGTTT CTAGGAAT GCT AAAATAT CAGAAGGCAGGAGAGGAGATAGGCTTATT ATGATACT 
AGT GAGTACATTAAGT AAAATAAAAT GGACCAGAAAAGAAAAGAAACCATAAAT AT CGT GT CAT ATTT T CCCCAAGAT 
TAACCAAAAATAATCTGCTTATCTTTTTGGTTGTCCTTTTAACTGTCTCCGTTTTTTTCTTTTATTTAAAAATGCACT 
TTTTTTCCCTTGT GAGTTATAGT CT GCTTATTT AAT TACCACTT T GCAAGCCT TACAAGAGAGCACAAGTT GGCCTAC 
ATTTTTATATTTTTTAAGAAGATACTTTGAGATGCATTATGAGAACTTTCAGTTCAAAGCATCAAATTGATGCCATAT 
CCAAGGACATGCCAAAT GCT GATT CT GT CAGGCACT GAAT GT CAGGCATT GAGACATAGGGAAGGAAT GGTTTGTACT 
AAT ACAGACGTACAGATACTTT CT CT GAAGAGTATT TT C GAAGAGGAGCM.CT GAACAC TGGAGGAAAAGAAAATGAC 
ACT TT CTGCTT TACAGAAAAGGAAACT CATT CAGAC T GGT GATAT CGT GAT GTACC T AAAAGT CAGAAACCACATTTT 
CTCCT CAGAAGTAG GGAC CGC TTT C TTACCT GT TTAAA.TAAAC CAAAGTATACC GT GT GAAC CAAACAAT CT CT TTTC 
AAAACAGGGTGCTCCTCCTGGCTTCTGGCTTCCATAAGAAGAAATGGAGAAAAATATATATATATATATATATATTGT 
GAAAGATCAATCCATCTGCCAGAATCTAGTGGGATGGAAGTTTTTGCTACATGTTATCCACCCCAGGCCAGGTGGAAG 
TAACT GAATTATTT TT TAAAT TAAGC AGT T CTACTCAAT CACC AAGAT GCTTCT GAAAATT GCATT T TATTACCATTT 
CAAACTATT TT TTAAAAAT AAATACAGT TAACAT AGAGT GGTTT C TTCATT CAT GT GAAAATT ATTAGCCAGCACCAG 
AT GCAT GAGCT AAT TATCT CT TT GAGT C CTT GCTT CTGTT T GCT CACAGTAAACT CATT GTTT AAAAGCTT CAAGAAC 
ATTCAAGCTGTTGGTGTGTTAAAAAATGCATTGTATTGATTTGTACTGGTAGTTTATGAAATTTAATTAAAACACAGG 
CCAT GAAT GGAAG GT GGT AT T GCACAGCTAATAAAATAT GATTT GTGGATATGAA 
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FIGURE 72 



MMMVRRGLLAWISRVWLLVLLCCAI SVLYMLACTPKGDEEQLALPRANSPTGKEGYQAVLQEWE 
EQHRNYVSSLKRQIAQLKEELQERSEQLRNGQYQASDAAGLGLDRSPPEKTQADLLAFLHSQVDK 
AEVNAGVKLATEYAAVPFDSFTLQKVYQLETGLTRHPEEKPVRKDKRDELVEAIESALETLNNPA 
ENS PNHRP YTAS DF I EG I YRTERDKG T L YE LT FKGDHKHE FKRL I L FRP FS P IMKVKNEKLNMAN 
TLINVIVPLAKRVDKFRQFMQNFREMCIEQDGRVHLTWYFGKEEINEVKGILENTSKAANFRNF 
TFIQLNGEFSRGKGLDVGARFWKGSNVLLFFCDVDIYFTSEFLNTCRLNTQPGKKVFYPVLFSQY 
NPG 1 1 YGHHDAVPPLEQQLVI KKE TG FWRDFGFGMT CQYRS DFI N I GG FDLD I KGWGGEDVHL YR 
KYLHSNLIWRTPTOGLFHLWHEKRCMDELTPEQYKMCMQSKAMNEASHGQLGMLVFRHEIEAH 



RKQKQKTS SKKT 




Important features: 
Signal peptide: 

amino acids 1-27 




N-glycosylation sites. 

amino acids 315-319, 324-328 



N-myristoylation sites . 

amino acids 96-102, 136-142, 212-218, ; 311-317, 339-345, 393-399 



Amidation site. 



amino acids 377-381 
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FIGURE 73 



GAGACTGCAGAGGGAGATAAAGAGAGAGGGCAAAGAGGCAGCAAGAGATTTGTCCTGGGGATCCA 
GAAACCCAT GATACCCTAC TGAACACCGAATCCCCTGGAAGCCCACAGAGACAGAGACAGCAAGA 
GAAGCAGAGATAAATACACTCACGCCAGGAGCTCGCTCGCTCTCTCTCTCTCTCTCTCACTCCTC 
CCTCCCTCTCTCTCTGCCTGTCCTAGTCCTCTAGTCCTCAAATTCCCAGTCCCCTGCACCCGTTG 
CTGGGACACTATGTTGTTCTCCGCCCTCCTGCTGGAGGTGATTTGGATCCTGGCTGCAGATGGGG 
GTCAACACTGGACGTATGAGGGCCCACATGGTCAGGACCATTGGCCAGCCTCTTACCCTGAGTGT 
GGAAACAATGCCCAGTCGCCCATCGATATTCAGACAGACAGTGTGACATTTGACCCTGATTTGCC 
jU TGCTCTGCAGCCCCACGGATATGACCAGCCTGGCACCGAGCCTTTGGACCTGCACAACAATGGCC 
CI ACACAGTGCAACTCTCTCTGCCCTCTACCCTGTATCTGGGTGGACTTCCCCGAAAATATGTAGCT 

•5:;;? I 

w 



GCCCAGCTCCACCTGCACTGGGGTCAGAAAGGATCCCCAGGGGGGTCAGAACACCAGATCAACAG 
TGAAGCCACATTTGCAGAGCTCCACATTGTACATTATGACTCTGATTCCTATGACAGCTTGAGTG 
AGGCTGCTGAGAGGCCTCAGGGCCTGGCTGTCCTGGGCATCCTAATTGAGGTGGGTGAGACTAAG 
AATATAGGTTATGAACACATTCTGAGTCACTTGCATGAAGTCAGGCATAAAGATCAGAAGACCTC 
AGTGCCTCCCTTCAACCTAAGAGAGCTGCTCCCCAAACAGCTGGGGCAGTACTTCCGCTACAATG 
GCTCGCTCACAACTCCCCCTTGCTACCAGAGTGTGCTCTGGACAGTTTTTTATAGAAGGTCCCAG 
ATTTCAATGGAACAGGTGGAAAAGCTTCAGGGGACATTGTTCTCCACAGAAGAGGAGCCCTCTAA 
El GCTTCTGGTACAGAACTACCGAGCCCTTCAGCCTCTCAATCAGCGCATGGTCTTTGCTTCTTTCA 

TCCAAGCAGGATCCTCGTATACCACAGGTGAAATGCTGAGTCTAGGTGTAGGAATCTTGGTTGGC 
TGTCTCTGCCTTCTCCTGGCTGTTTATTTCATTGCTAGAAAGATTCGGAAGAAGAGGCTGGAAAA 
CCGAAAGAGTGTGGTCTTCACCTCAGCACAAGCCACGACTGAGGCATAAATTCCTTCTCAGATAC 
CATGGATGTGGATGACTTCCCTTCATGCCTATCAGGAAGCCTCTAAAATGGGGTGTAGGATCTGG 
CCAGAAACACTG TAGGAG TAG TAAGCAGAT GT CC TCC T TCCCCTGGACAT C T C T TAGAGAGGAAT 
GGAC CCAGGCTG TCAT TC CAGGAAGAACT GCAGAGCCT TCAGCC T CT CCAAACAT GTAGGAGGAA 
ATGAGGAAATCGCTGTGTTGTTAATGCAGAGANCAAACTCTGTTTAGTTGCAGGGGAAGTTTGGG 
ATATACCCCAAAGTCCTCTACCCCCTCACTTTTATGGCCCTTTCCCTAGATATACTGCGGGATCT 
C TCC T T AGGATAAAGAGT T GC T GT T GAAGT TGTATAT T T T T GAT CAAT AT AT T TGGAAAT TAAAG 
TTTCTGACTTT 
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FIGURE 74 



MLFSALLLEVIWILAADGGQHWTYEGPHGQDHWPASYPECGNNAQSPIDIQTDSVTFDPDLPALQ 
PHGYDQPGTEPLDLHNNGHTVQLSLPSTLYLGGLPRKYVAAQLHLHWGQKGSPGGSEHQINSEAT 
FAELHIVHYDSDSYDSLSEAAERPQGLAVLGILIEVGETKNIAYEHILSHLHEVRHKDQKTSVPP 
FNLRELLPKQLGQYFRYNGSLTTPPCYQSVLWTVFYRRSQISMEQLEKLQGTLFSTEEEPSKLLV 
QNYRALQPLNQRMVFASFIQAGSSYTTGEMLSLGVGILVGCLCLLLAVYFIARKIRKKRLENRKS 
WFT SAQAT TEA 

Important features of the protein: 




Signal peptide: 



amino acids 1-15 



Transmembrane domain: 



amino acids 291-310 




N-glycosylation site. 



amino acids 213-216 



Eukaryotic-type carbonic anhydrases proteins 



amino acids 197-245, 104-140, 22-69 
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FIGURE 25 



TGCCGCTGCCGCCGCTGCTGCTGTTGCTCCTGGCGGCGCCTTGGGGACGGGCAGTTCCCTGTGTC 
TCTGGTGGTTTGCCTAAACCTGCAAACATCACCTTCTTATCCATCAACATGAAGAA.TGTCCTACA 
AT GGACT CCACCAGAGGGT CT T CAAGGAGT T AAAGT TAC T TACACTGTGCAGTAT TTCATCACAA 
ATTGGCCCACCAGAGGTGGCACTGACTACAGATGAGAAGTCCATTTCTGTTGTCCTGACAGCTCC 
AGAGAAGTGGAAGAGAAATCCAGAAGACCTTCCTGTTTCCATGCAACAAATATACTCCAATCTGA 
AGTATAACGTGTCTGTGTTGAATACTAAATCAAACAGAACGTGGTCCCAGTGTGTGACCAACCAC 
ACGCTGGTGCTCACCTGGCTGGAGCCGAACACTCTTTACTGCGTACACGTGGAGTCCTTCGTCCC 
AGGGCCCCCTCGCCGTGCTCAGCCTTCTGAGAAGCAGTGTGCCAGGACTTTGAAAGATCAATCAT 
CAGAGTTCAAGGCTAAAATCATCTTCTGGTATGTTTTGCCCATATCTATTACCGTGTTTCTTTTT 
TCTGTGATGGGCTATTCCATCTACCGATATATCCACGTTGGCAAAGAGAAACACCCAGCAAATTT 
GATTTTGATTTATGGAAATGAATTTGACAAAAGATTCTTTGTGCCTGCTGAAAAAATCGTGATTA 
%J ACTTTATCACCCTCAATATCTCGGATGATTCTAAAATTTCTCATCAGGATATGAGTTTACTGGGA 
LlJ AAAAGCAGTGATGTATCCAGCCTTAATGATCCTCAGCCCAGCGGGAACCTGAGGCCCCCTCAGGA 

GGAAGAGGAGGTGAAACATTTAGGGTATGCTTCGCATTTGATGGAAATTTTTTGTGACTCTGAAG 
AAAACACGGAAGGTACT T C TC T CAC CCAGCAAGAGTCCC TCAGCAGAACAAT ACCCCCGGATAAA 
ACAGT CAT T GAATAT GAATAT GATG T CAGAACCAC TGACAT TT GT GC GGGGCCTGAAGAGCAGGA 
Q GCT CAGT T T GCAGGAGGAGGT GT CCACACAAGGAACAT T AT TGGAG T CGCAGGCAGC GT TGGCAG 

||| T CT TGGGGCCGCAAACGT TACAGTAC T CATACACCCC TCAGCT C CAAGAC T T AGACCCCC T GGCG 

M CAG GAGCACACAGAC TCGGAGGAGGGGC C GGAGGAAGAGCCAT CGACGAC CCTGG TCGACT GGGA 



TCCCCAAACTGGCAGGCTGTGTATTCCTTCGCTGTCCAGCTTCGACCAGGATTCAGAGGGCTGCG 
AGCCTTCTGAGGGGGATGGGCTCGGAGAGGAGGGTCTTCTATCTAGACTCTATGAGGAGCCGGCT 
C CAGACAGGCCACCAGGAGAAAATGAAACC TATC TCATGCAAT T CAT GGAGGAATGGGGGT TAT A 
TGTGCAGATGGAAAACTGATGCCAACACTTCCTTTTGCCTTTTGTTTCCTGTGCAAACAAGTGAG 
TCACCCCTTTGATCCCAGCCATAAAGTACCTGGGATGAAAGAAGTTTTTTCCAGTTTGTCAGTGT 
CTGTGAGAATTACTTATTTCTTTTCTCTATTCTCATAGCACGTGTGTGATTGGTTCATGCATGTA 
GGTCTCTTAACAATGATGGTGGGCCTCTGGAGTCCAGGGGCTGGCCGGTTGTTCTATGCAGAGAA 
AGCAGTCAATAAATGTTTGCCAGACTGGGTGCAGAATTTATTCAGGTGGGTGT 



h 

SB? 

m 
w 



.pi?. 
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FIGURE 76 



MSYNGLHQRVFKELKLLTLCSISSQIGPPEVALTTDEKSISWLTAPSKWKRNPEDLPVSMQQIY 
SNLKYNVSVLNTKSNRTWSQCVTNHTLVLTWLEPNTLYCVHVESFVPGPPRRAQPSEKQCARTLK 
DQS SE FKAKI I FWYVLP I S I TVFLFS VMG YS I YRY I HVGKEKHPANL I L I YGNE FDKRFFVPAEK 
IVINFITLNISDDSKISHQDMSLLGKSSDVSSLNDPQPSGNLRPPQEEEEVKHLGYASHLMEIFC 
DSEENTEGTSLTQQESLSRTIPPDKTVIEYEYDVRTTDICAGPEEQELSLQEEVSTQGTLLESQA 
ALAVLGPQTLQYSYTPQLQDLDPLAQEHTDSEEGPEEEPSTTLVDWDPQTGRLCIPSLSSFDQDS 
EGCEPSEGDGLGEEGLLSRLYEEPAPDRPPGENETYLMQFMEEWGLYVQMEN 




Important features: 
Signal peptide: 



amino acids 1-28 



m 



Transmembrane domain: 



amino acids 140-163 



m 



N-glycosylation sites. 




amino acids 71-74, 80-83, 89-92, 204-207, 423-426 
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FIGURE 77 



GAGGAGCGGGCCGAGGACTCCAGCGTGCCCAGGTCTGGCATCCTGCACTTGCTGCCCTCTGACAC 
CTGGGAAG ATG GCCGGCCCGTGGACCTTCACCCTTCTCTGTGGTTTGCTGGCAGCCACCTTGATC 
CAAGCCACCCTCAGTCCCACTGCAGTTCTCATCCTCGGCCCAAAAGTCATCAAAGAAAAGCTGAC 
ACAGGAGCTGAAGGACCACAACGCCACCAGCATCCTGCAGCAGCTGCCGCTGCTCAGTGCCATGC 
GGGAAAAGCCAGCCGGAGGCATCCCTGTGCTGGGCAGCCTGGTGAACACCGTCCTGAAGCACATC 
ATCTGGCTGAAGGTCATCACAGCTAACATCCTCCAGCTGCAGGTGAAGCCCTCGGCCAATGACCA 
GGAGC T GC TAGT CAAGAT CCCC C T G GACAT GGT GGC TGGAT T C AACACGC CCC TGGT CAAGAC CA 
H TCGTGGAGTTCCACATGACGACTGAGGCCCAAGCCACCATCCGCATGGACACCAGTGCAAGTGGC 

CCCACCCGCCTGGTCCTCAGTGACTGTGCCACCAGCCATGGGAGCCTGCGCATCCAACTGCTGTA 
TAAGCTCTCCTTCCTGGTGAACGCCTTAGCTAAGCAGGTCATGAACGTCCTAGTGCCATCCCTGC 
CCAATCTAGTGAAAAACCAGCTGTGTCCCGTGATCGAGGCTTCCTTCAATGGCATGTATGCAGAC 



%l CTCCTGCAGCTGGTGAAGGTGCCCATTTCCCTCAGCATTGACCGTCTGGAGTTTGACCTTCTGTA 
W TCCTGCCATCAAGGGTGACACCATTCAGCTCTACCTGGGGGCCAAGTTGTTGGACTCACAGGGAA 



???r AGGTGACCAAGTGGTTCAATAACTCTGCAGCTTCCCTGACAATGCCCACCCTGGACAACATCCCG 

TTCAGCCTCATCGTGAGTCAGGACGTGGTGAAAGCTGCAGTGGCTGCTGTGCTCTCTCCAGAAGA 
If! . ATTCATGGTCCTGTTGGACTCTGTGCTTCCTGAGAGTGCCCATCGGCTGAAGTCAAGCATCGGGC 



T GAT CAAT GAAAAGGC TGCAGATAAGC TGGGAT CTACCCAGAT CGTGAAGATCCTAACT CAGGAC 
ACTCCCGAGTTTTTTATAGACCAAGGCCATGCCAAGGTGGCGCAACTGATCGTGCTGGAAGTGTT 
TCCCTCCAGTGAAGCCCTCCGCCCTTTGTTCACGCTGGGCATCGAAGCCAGCTCGGAAGCTCAGT 
TTTACACCAAAGGTGACCAACTTATACTCAACTTGAATAACATCAGCTCTGATCGGATCCAGCTG 
ATGAACTCTGGGATTGGCTGGTTCCAACCTGATGTTCTGAAAAACATCATCACTGAGATCATCCA 
CTCCATCCTGCTGCCGAACCAGAATGGCAAATTAAGATCTGGGGTCCCAGTGTCATTGGTGAAGG 
CCTTGGGATTCGAGGCAGCTGAGTCCTCACTGACCAAGGATGCCCTTGTGCTTACTCCAGCCTCC 
TTGTGGAAACCCAGCTCTCCTGTCTCCCAGTGAAGACTTGGATGGCAGCCATCAGGGAAGGCTGG 
GTCCCAGCTGGGAGTATGGGTGTGAGCTCTATAGACCATCCCTCTCTGCAATCAATAAACACTTG 
CCTGTGAAAAA 
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MAGPWTFTLLCGLLAATLIQATLSPTAVLILGPKVIKSKLTQELKDHNATSILQQLPLLSAMREK 
PAGGI PVLGSLVNTVLKHI IWLKVI TANI LQLQVKPSANDQELLVKI PLDMVAGFNT PLVKT I VE 
FHMTTEAQAT I RMDTSASGPTRLVLSDCATSHGSLRI QLLYKLS FLVNALAKQVMNLLVPSLPNL 
VKNQLCPVI EAS FNGMYADLLQLVKVPI S LS I DRLE FDLL YPAIKGDT I QLYLGAKLLDSQGKVT 
KWFNNSAASLTMPTLDNIPFSLIVSQDVVKAAVAAVLSPEEFMVLLDSVLPESAHRLKSSIGLIN 
EKAADKLGSTQIVKILTQDTPEFFIDQGHAKVAQLIVLEVFPSSEALRPLFTLGIEASSEAQFYT 
KGDQL I LNLNMI S S DRI QLMNS G I GWFQPDVLKNI I TE 1 1 HS I LLPNQNGKLRSGVPVS LVKALG 



FE AAE S S L TKDALVL T PAS LWKP S S PVS Q 



6 



Important features of the protein: 



Signal peptide: 



amino acids 1-21 




N-glycosylation sites. 

amino acids 48-51, 264-267, 401-404 



Glycosaminoglycan attachment site. 



amino acids 412-415 



LBP / BPI / CETP family proteins. 



amino acids 407-457 
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FIGURE 7Q 



GAGAGAAGT CAGCC TGGCAGAGAGAC TCTGAAATGAGGGATT AGAGGT GT T CAAGGAGCAAGAGC 
TTCAGCCTGAAGACAAGGGAGCAGTCCCTGAAGACGCTTCTACTGAGAGGTCTGCCATGGCCTCT 
CTTGGCCTCCAACTTGTGGGCTACATCCTAGGCCTTCTGGGGCTTTTGGGCACACTGGTTGCCAT 
GCTGCTCCCCAGCTGGAAAACAAGTTCTTATGTCGGTGCCAGCATTGTGACAGCAGTTGGCTTCT 
CCAAGGGCCTCTGGATGGAATGTGCCACACACAGCACAGGCATCACCCAGTGTGACATCTATAGC 
ACCCTTCTGGGCCTGCCCGCTGACATCCAGGCTGCCCAGGCCATGATGGTGACATCCAGTGCAAT 
CTCCTCCCTGGCCTGCATTATCTCTGTGGTGGGCATGAGATGCACAGTCTTCTGCCAGGAATCCC 
GAGCCAAAGACAGAGTGGCGGTAGCAGGTGGAGTCTTTTTCATCCTTGGAGGCCTCCTGGGATTC 
ATTCCTGTTGCCTGGAATCTTCATGGGATCCTACGGGACTTCTACTCACCACTGGTGCCTGACAG 

m CATGAAATTTGAGATTGGAGAGGCTCTTTACTTGGGCATTATTTCTTCCCTGTTCTCCCTGATAG 

u f ■ 

Ul CTGGAATCATCCTCTGCTTTTCCTGCTCATCCCAGAGAAATCGCTCCAACTACTACGATGCCTAC 

H- CAAGCCCAACCTCTTGCCACAAGGAGCTCTCCAAGGCCTGGTCAACCTCCCAAAGTCAAGAGTGA 

is .. ; 

GTTCAATTCCTACAGCCTGACAGGGTATGTGTGAAGAACCAGGGGCCAGAGCTGGGGGGTGGCTG 

GGTCTGTGAAAAACAGTGGACAGCACCCCGAGGGCCACAGGTGAGGGACACTACCACTGGATCGT 

pi- 

m GTCAGAAGGTGC T GC TGAGGATAGAC T GAC TT TGGCCATT GGAT T GAGCAAAGGCAGAAATGGGG 

Ul GCTAGTGTAACAGCATGCAGGTTGAATTGCCAAGGATGCTCGCCATGCCAGCCTTTCTGTTTTCC 

TCACCTTGCTGCTCCCCTGCCCTAAGTCCCCAACCCTCAACTTGAAACCCCATTCCCTTAAGCCA 
GGACTCAGAGGATCCCTTTGCCCTCTGGTTTACCTGGGACTCCATCCCCAAACCCACTAATCACA 
TCCCACTGACTGACCCTCTGTGATCAAAGACCCTCTCTCTGGCTGAGGTTGGCTCTTAGCTCATT 
GCTGGGGATGGGAAGGAGAAGCAGTGGCTTTTGTGGGCATTGCTCTAACCTACTTCTCAAGCTTC 
CCTCCAAAGAAACTGATTGGCCCTGGAACCTCCATCCCACTCTTGTTATGACTCCACAGTGTCCA 
GACTAAT T TGTGCATGAACTGAAATAAAACCATCCTACGGTAT CCAGGGAACAGAAAGCAGGATG 
C AGGAT GGGAGGAC AGGAAGGC AGC C T GGGAC AT T T AAAAAAAT A 



:-. z 



CO 
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MASLGLQLVGYILGLLGLLGTLVAMLLPSWKTSSYVGASIVTAVGFSKGLWMECATHSTGITQCD 
I YS T LLGLPADI QAAQAMMVT S SAI S SLAC 1 1 S WGMRC TVFC QS S RAKDRVAVAGGVFF I LGGL 
LGFI PVAWNLHG I LRDFYS PLVPDSMKFE I GEALYLG IIS SL FS LI AG 1 1 LCFSCS S QRNRSNYY 
DAYQAQPLATRSSPRPGQPPKVKSEFNSYSLTGYV 

Important features of the protein: 
Signal peptide: 



amino acids 1-24 



m 



Transmembrane domains: 



amino acids 82-102, 117-140, 163-182 




N-glycosylation site. 



amino acids 190-193 




PMP-22 / EMP / MP20 family proteins. 

amino acids 4 6-59 
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FIGURE 81 



CCCACGCGTCCGCGCCTCTCCCTTCTGCTGGACCTTCCTTCGTCTCTCCATCTCTCCCTCCTTTC 
CCCGCGTTCTCTTTCCACCTTTCTCTTCTTCCCACCTTAGACCTCCCTTCCTGCCCTCCTTTCCT 
GCCCACCGCTGCTTCCTGGCCCTTCTCGGACCCCGCTCTAGCAGCAGACCTCCTGGGGTCTGTGG 
GTTGATCTGTGGCCCCTGTGCCTCCGTGTCCTTTTCGTCTCCCTTCCTCCCGACTCCGCTCCCGG 
ACCAGCGGCCTGACCCTGGGGAAAGGATGGTTCCCGAGGTGAGGGTCCTCTCCTCCTTGCTGGGA 
CTCGCGCTGCTCTGGTTCCCCCTGGACTCCCACGCTCGAGCCCGCCCAGACATGTTCTGCCTTTT 
CCATGGGAAGAGATACTCCCCCGGCGAGAGCTGGCACCCCTACTTGGAGCCACAAGGCCTGATGT 
|=4 ACTGCCTGCGCTGTACCTGCTCAGAGGGCGCCCATGTGAGTTGTTACCGCCTCCACTGTCCGCCT 
W GTCCACTGCCCCCAGCCTGTGACGGAGCCACAGCAATGCTGTCCCAAGTGTGTGGAACCTCACAC 

TCCCTCTGGACTCCGGGCCCCACCAAAGTCCTGCCAGCACAACGGGACCATGTACCAACACGGAG 
AGATCTTCAGTGCCCATGAGCTGTTCCCCTCCCGCCTGCCCAACCAGTGTGTCCTCTGCAGCTGC 
%l ACAGAGGGCCAGATCTACTGCGGCCTCACAACCTGCCCCGAACCAGGCTGCCCAGCACCCCTCCC 
W AC T GCCAGACT CC TGC T GCCAAGCC T GCAAAGAT GAGGC AAGT GAGCAAT CGGAT GAAGAGGACA 

GTGTGCAGTCGCTCCATGGGGTGAGACATCCTCAGGATCCATGTTCCAGTGATGCTGGGAGAAAG 
AGAGGCCCGGGCACCCGAGCCCCCACTGGCGTCAGCGCCCCTCTGAGCTTCATCCCTCGCCACTT 



Hi 



PI 

m CAGACCCAAGGGAGCAGGCAGCACAACT G T CAAGATC GT CCTGAAGGAGAAACATAAGAAAGCCT 



GTGTGCATGGCGGGAAGACGTACTCCCACGGGGAGGTGTGGCACCCGGCCTTCCGTGCCTTCGGC 
03 CCCTTGCCCTGCATCCTATGCACCTGTGAGGATGGCCGCCAGGACTGCCAGCGTGTGACCTGTCC 
W CACCGAGTACCCGTGCCGTCACCCCGAGAAAGTGGCTGGGAAGTGCTGCAAGATTTGCCCAGAGG 

ACAAAGCAGACCCTGGCCACAGTGAGATCAGTTCTACCAGGTGTCCCAAGGCACCGGGCGGGGTC 
CTCGTCCACACATCGGTATCCCCAAGCCCAGACAACCTGCGTCGCTTTGCCCTGGAACACGAGGC 
C T CGGAC T T GGTGGAGATCTACCTC TGGAAGCTGGTAAAAGAT GAGGAAACT GAGGCTCAGAGAG 
GT GAAGT ACCT GGCCCAAGGCCACACAGC CAGAATC T TCCACT T GACT CAGATCAAGAAAG TCAG 
GAAGCAAGACTTCCAGAAAGAGGCACAGCACTTCCGACTGCTCGCTGGCCCCCACGAAGGTCACT 
GGAACGTCTTCCTAGCCCAGACCCTGGAGCTGAAGGTCACGGCCAGTCCAGACAAAGTGACCAAG 
ACAT AACAAAGACC TAACAG T T GCAGATAT GAGC T GTATAAT T G TT GT TAT T ATATAT TAATAAA 
TAAGAAG T T GCAT TAC CC T CAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 82 

MVPEYRVLSSLLGLALLWFPLDSHARARPDMFCLFHGKRYSPGESWHPYLEPQGLMYCLRCTCSE 
GAHVSCYRLHCPPVHCPQPVTEPQQCCPKCVEPHTPSGLRAPPKSCQHNGTMYQHGEIFSAHELF 
PSRLPNQCVLCSCTEGQIYCGLTTCPEPGCPAPLPLPDSCCQACKDEASEQSDEEDSVQSLHGVR 
HPQDPCSSDAGRKRGPGTPAPTGLSAPLSFIPRHFRPKGAGSTTVKIVLKEKHKKACVHGGKTYS 
HGEVWHPAFRAFGPLPCILCTCEDGRQDCQRVTCPTEYPCRHPEKVAGKCCKICPEDKADPGHSE 
ISSTRCPKAPGRVLVHTSVSPSPDNLRRFALEHEASDLVEIYLWKLVKDEETEAQRGEVPGPRPH 
SQNLPLDSDQESQEARLPERGTALPTARWPPRRSLERLPSPDPGAEGHGQSRQSDQDITKT 

Signal peptide: 

amino acids 1-25 
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FIGURE 83 



Ci 
o 

5 :: s 



Ill 



o 



GACAGCTGTGTCTCGATGGAGTAGACTCTCAGAACAGCGCAGTTTGCCCTCCGCTCACGCAGAGCCTCTCC 
GTGGCTTCCGCACCTTGAGCATTAGGCCAGTTCTCCTCTTCTCTCTAATCCATCCGTCACCTCTCCTGTCA 
TCCGTTTCCATGCCGTGAGGTCCATTCACAGAACACATCC ATG GCTCTCATGCTCAGTTTGGTTCTGAGTC 
TCCTCAAGCTGGGATCAGGGCAGTGGCAGGTGTTTGGGCCAGACAAGCCTGTCCAGGCCTTGGTGGGGGAG 
GACGC AGC AT TCTC CTGTTTCCTGTCTCCTAAGACCAATGC AG AGGCC AT GGAAGTGCGGTTCTTC AG GGG 
C CAGTT CTC T AGCGT GGT CC ACCT CT AC AG GGACGG GAAGGACC AGCC AT TT AT GCAG AT GCC AC AGT AT C 
AAGGC AGG AC AAAACT GGT G AAGGAT TCT ATT GCGGAGGGGCGC AT CT CT CTGAGGCTGG AAAAC AT T AC T 
GTGTTGGATGCTGGCCTCTATGGGTGCAGGATTAGTTCCCAGTCTTACTACCAGAAGGCCATCTGGGAGCT 
AC AGGT GT CAGC ACTGGGCTCAGT T C CTCT C AT TTCCATCACGGGAT ATGTT GAT AGAG AC AT CCAGC T AC 
TCTGTCAGTCCTCGGGCTGGTTCCCCCGGCCCACAGCGAAGTGGAAAGGTCCACAAGGACAGGATTTGTCC 
AC AG AC TCCAGG AC AAAC AGAGAC AT GC AT GG CCT GTT TGAT GT GGAG AT CTCT CT GACCGT C C AAGAGAA 
CGC C GGGAGC AT AT CCTGTT CC AT GC GGCAT G CT CATCT GAGCC GAGAGGTGGAAT CC AGGGT ACAGAT AG 
GAGATACCTTTTTCGAGCCTATATCGTGGCACCTGGCTACCAAAGTACTGGGAATACTCTGCTGTGGCCTA 



|y TTTTTTGGCATTGTTGGACTGAAGATTTTCTTCTCCAAATTCCAGTGGAAAATCCAGGCGGAACTGGACTG 

GAGAAG AAAGC ACGGAC AGGCAGAATTGAG AG AC GC CC GG AAAC ACGC AGT GG AGGT G ACTCT GGATC C AG 



W- AGACGG CTCACC CG AAGCTC TGCGT T TC TGAT CT GAAAACT G TAACCC AT AG AAAAGCT C CC C AGGAGGT G 

W CCTC AC TCTGAG AAGAGATT TACAAGGAAGAGT GTGGT GGCT TCTC AG AGT TT C CAAGCAGGGAAACATT A 



CT GGGAGGT GGACGGAGGAC AC AAT AAAAGGT GGCGCGTGGGAG TGT GCCGGGATGAT GT GGAC AGGAGGA 
AGGAGTACGTGACTTTGTCTCCCGATCATGGGTACTGGGTCCTCAGACTGAATGGAGAACATTTGTATTTC 
ACATTAAATCCCCGTTTTATCAGCGTCTTCCCCAGGACCCCACCTACAAAAATAGGGGTCTTCCTGGACTA 
TGAGTGTGGGACCATCTCCTTCTTCAACATAAATGACCAGTCCCTTATTTATACCCTGACATGTCGGTTTG 
AAGG CT TAT T GAGGCC CT AC AT TG AGT ATCCGT CC T AT AATG AGCAAAAT GGAACTCCC AT AGTC ATC TGC 
CCAGTCACCCAGGAATCAGAGAAAGAGGCCTCTTGGCAAAGGGCCTCTGCAATCCCAGAGACAAGCAACAG 
TG AGTC CTCC TC AC AGGC AACC ACGCCCTT CCTCCC CAGGGGT GAAAT G TAGG AT G AATC AC ATCCCACAT 
TCTTCTTTAGGGATATTAAGGTCTCTCTCCCAGATCCAAAGTCCCGCAGCAGCCGGCCAAGGTGGCTTCCA 
GATGAAGGGGGACTGGCCT GT C CAC ATGGG AGT C AGGTGT CATGGCTGCCCT GAGC T GGGAGGGAAGAAGG 
CTGACATT AC AT TT AGTT T GC T CT C ACTCC AT CT GGCT AAGT G ATC TT GAAAT ACC ACCT CT CAGGTGAAG 
AACCGT C AGG AATT CCCAT CT C AC AGGCTG TGGT GT AGATT AAGT AGACAAGGAAT GTGAAT AATGCT TAG 
ATCT TATTGATGACAGAGTGTATCCT AATGGT TTGTTCAT TATATTACACTTTC AGTAAAAAAA 



APP JD=1 0063732 Page 237 of 322 



FIGURE 8a 



MALMLSLVLSLLKLGSGQWQVFGPDKPVQALVGEDAAFSCFLSPKTNAEAMEVRFFRGQFSSWH 
LYRDGKDQPFMQMPQYQGRTKLVKDS IAEGRISLRLENITVLDAGLYGCRISSQSYYQKAIWELQ 
VSALGSVPL I S I TGYVDRDIQLLCQS SGW FPRP TAKWKGPQGQDLS TDSRTNRDMHGLFDVE I SL 
TVQENAGS I SCSMRHAHLSREVESRVQI GDT FFEP I SWHLATKVLG I LCCGLFFGIVGLKI FFSK 
FQWKIQAELDWRRKHGQAELRDARKHAVEVTLDPETAHPKLCVSDLKTVTHRKAPQEVPHSEKRF 
TRKSWASQS FQAGKHYWEVDGGHNKRWRVGVCRDDVDRRKE WTLS PDHGYWVLRLNGEHLYFT 
LNPRFISVFPRTPPTKIGVFLDYECGTISFFNINDQSLIYTLTCRFEGLLRPYIEYPSYNEQNGT 
PIVICPVTQESEKEASWQRASAIPETSNSESSSQATTPFLPRGEM 




Signal peptide: 



amino acids 1-17 



Transmenjbrane domain: 



amino acids 239-255 
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FIGURE 8s 



AACAGACGTTCCCTCGCGGCCCTGGCACCTCTAACCCCAGAC ATGC TGCTGCTGCTGCTGCCCCT 
GCTCTGGGGGAGGGAGAGGGCGGAAGGACAGACAAGTAAACTGCTGACGATGCAGAGTTCCGTGA 
CGGTGCAGGAAGGCCTGTGTGTCCATGTGCCCTGCTCCTTCTCCTACCCCTCGCATGGCTGGATT 
TACCCTGGCCCAGTAGTTCATGGCTACTGGTTCCGGGAAGGGGCCAATACAGACCAGGATGCTCC 
AGTGGCCACAAACAACCCAGCTCGGGCAGTGTGGGAGGAGACTCGGGACCGATTCCACCTCCTTG 
GGGACCC ACAT ACC AAGAAT T GCAC CC TG AG CATC AGAGAT GCC AGAAGAAGT GAT GCGGGGAGA 
TAC T TCT T TCGTAT GGAGAAAGGAAGT AT AAAATGGAAT TATAAACAT CACCGGCTCT CT GT GAA 
•U TGTGACAGCCTTGACCCACAGGCCCAACATCCTCATCCCAGGCACCCTGGAGTCCGGCTGCCCCC 
CI AGAATCTGACCTGCTCTGTGCCCTGGGCCTGTGAGCAGGGGACACCCCCTATGATCTCCTGGATA 

GGGACCTCCGTGTCCCCCCTGGACCCCTCCACCACCCGCTCCTCGGTGCTCACCCTCATCCCACA 



7. - : 

CP •• 



SI 



CI 

:: iivt 



GCCCCAGGACCATGGCACCAGCCTCACCTGTCAGGTGACCTTCCCTGGGGCCAGCGTGACCACGA 
ACAAGACCGTCCATCTCAACGTGTCCTACCCGCCTCAGAACTTGACCATGACTGTCTTCCAAGGA 



|j GACGGCACAGTATCCACAGTCTTGGGAAATGGCTCATCTCTGTCACTCCCAGAGGGCCAGTCTCT 
1l GCGCCTGGTCTGTGCAGTTGATGCAGTTGACAGCAATCCCCCTGCCAGGCTGAGCCTGAGCTGGA 

GAGGCCTGACCCTGTGCCCCTCACAGCCCTCAAACCCGGGGGTGCTGGAGCTGCCTTGGGTGCAC 
CTGAGGGATGCAGCTGAATTCACCTGCAGAGCTCAGAACCCTCTCGGCTCTCAGCAGGTCTACCT 



m 

fr% GAACGTC T CCC T GCAGAGCAAAGCCACAT CAGGAGT GAC T CAGGGGG T GG T C GGGGGAGCT GGAG 



CCACAGCCCTGGTCTTCCTGTCCTTCTGCGTCATCTTCGTTGTAGTGAGGTCCTGCAGGAAGAAA 
CI TCGGCAAGGCCAGCAGCGGGCGTGGGAGATACGGGCATAGAGGATGCAAACGCTGTCAGGGGTTC 

m 

AGCCTCTCAGGGGCCCCTGACTGAACCTTGGGCAGAAGACAGTCCCCCAGACCAGCCTCCCCCAG 
CTTCTGCCCGCTCCTCAGTGGGGGAAGGAGAGCTCCAGTATGCATCCCTCAGCTTCCAGATGGTG 
AAGCCTTGGGACTCGCGGGGACAGGAGGCCACTGACACCGAGTACTCGGAGATCAAGATCCACAG 
ATGAGAAACTGCAGAGACTCACCCTGATTGAGGGATCACAGCCCCTCCAGGCAAGGGAGAAGTCA 
GAGGCTGATTCTTGTAGAATTAACAGCCCTCAACGTGATGAGCTATGATAACACTATGAATTATG 
T GCAGAGT GAAAAGCACACAGGCTT TAGAGT CAAAGTAT C T CAAAC C T GAAT CC ACAC TGTGCCC 
T C C C T T T TAT T T T T T T AAC T AAAAG AC AG AC AAAT T C C T A 
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FIGURE 86 

MLLLLLPLLWGRERAEGQTSKLLTMQSSVTVQEGLCVHVPCSFSYPSHGWIYPGPVVHGYW'FREG 
ANTDQDAPVATNNPARAVWEETRDRFHLLGDPHTKNCTLS IRDARRS DAGRY FFRMEKGS I KWNY 
KHHRLSVNVTALTHRPNILIPGTLESGCPQNLTCSVPWACEQGTPPMISWIGTSVSPLDPSTTRS 
SVLTLIPQPQDHGTSLTCQVTFPGASVTTNKTVHLNVSYPPQNLTMTVFQGDGTVSTVLGNGSSL 
SLPEGQSLRLVCAVDAVDSNPPARLSLSWRGLTLCPSQPSNPGVLELPWVHLRDAAEFTCRAQNP 
LGSQQVYLNVSLQSKATSGVTQGWGGAGATALVFLS FCVI FVWRSCRKKSARPAAGVGDTGIE 
DANAVRGSASQGPLTEPWAEDSPPDQPPPASARSSVGEGELQYASLSFQMVKPWDSRGQEATDTE 
M YSEIKIHR 



M 



HI 



f? 

II! 



Signal peptide: 

amino acids 1-15 

Transmembrane domain: 

amino acids 351-370 
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FIGURE 87 



AGAAAGCTGCACTCTGTTGAGCTCCAGGGCGCAGTGGAGGGAGGGAGTGAAGGAGCTCTCTGTAC 
CCAAGGAAAGTGCAGCTGAGACTCAGACAAGATTACA ATGA ACCAACTCAGCTTCCTGCTGTTTC 
TCATAGCGACCACCAGAGGATGGAGT ACAGATGAGGC TAATAC T T AC TTCAAGGAATGGACC TG T 
TCTTCGTCTCCATCTCTGCCCAGAAGCTGCAAGGAAATCAAAGACGAATGTCCTAGTGCATTTGA 
TGGCCTGTATTTTCTCCGCACTGAGAATGGTGTTATCTACCAGACCTTCTGTGACATGACCTCTG 
GGGGTGGCGGCTGGACCCTGGTGGCCAGCGTGCATGAGAATGACATGCGTGGGAAGTGCACGGTG 
GGCGATCGCTGGTCCAGTCAGCAGGGCAGCAAAGCAGACTACCCAGAGGGGGACGGCAACTGGGC 
CAACTACAACACCTTTGGATCTGCAGAGGCGGCCACGAGCGATGACTACAAGAACCCTGGCTACT 
£3 ACGACATCCAGGCCAAGGACCTGGGCATCTGGCACGTGCCCAATAAGTCCCCCATGCAGCACTGG 
M AGAAACAGCTCCCTGCTGAGGTACCGCACGGACACTGGCTTCCTCCAGACACTGGGACATAATCT 
J*! G TT T GGCAT C TACCAGAAATAT CCAGTGAAATATGGAGAAGGAAAG TGTT GGAC TGACAACGGCC 

7 l { CGGTGATCCCTGTGGTCTATGATTTTGGCGACGCCCAGAAAACAGCATCTTATTACTCACCCTAT 
h] GGCCAGCGGGAATTCACTGCGGGATTTGTTCAGTTCAGGGTATTTAATAACGAGAGAGCAGCCAA 
Ill CGCCTTGTGTGCTGGAATGAGGGTCACCGGATGTAACACTGAGCATCACTGCATTGGTGGAGGAG 

GATACTTTCCAGAGGCCAGTCCCCAGCAGTGTGGAGATTTTTCTGGTTTTGATTGGAGTGGATAT 
GGAACTCATGTTGGTTACAGCAGCAGCCGTGAGATAACTGAGGCAGCTGTGCTTCTATTCTATCG 
T TGAGAG T T T TGTGGGAGGGAACCCAGACCTC T C CT CCCAACCAT GAGATCCCAAGGAT GGAGAA 



m 
ifl 

385' ? 

Of . ' CAAC T TACCCAGTAGCTAGAAT GT T AAT GGC AGAAGAGAAAACAATAAATCATAT TGAC T CAAGA 

O AAAAAA 

jFH 5 
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FIGURE 88 



MNQLSFLLFLIATTRGWSTDEANTYFKEWTCSSSPSLPRSCKEIKDECPSAFDGLYFLRTENGVI 
YQT FCDMTSGGGGWTLVAS VHENDMRGKCT VGDRWS S'QQGSKADYPEGDGNWANYNT FGSAEAAT 
SDDYKNPGYYDIQAKDLGIWHVPNKSPMQHWRNSSLLRYRTDTGFLQTLGHNLFGIYQKYPVKYG 
EGKCWT DNGP VI PWYDFGDAQKT AS Y YS PYGQRE FTAGFVQ FRVFNNERAANALCAGMRVT GCN 
TEHHCIGGGGYFPEASPQQCGDFSGFDWSGYGTHVGYSSSREITEAAVLLFYR 

Important features: 
Signal peptide: 




N-glycosylation site. 



amino acids 1-16 



amino acids 163-167 




Glycosaminoglycan attachment sites. 

amino acids 74-78, 289-293 



N-myristoylation sites. 



amino acids 76-82, 115-121, 124-130, 253-259, 292-298 
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FIGURE 8q 



CTAGATTTGTCGGCTTGCGGGGAGACTTCAGGAGTCGCTGTCTCTGAACTTCCAGCCTCAGAGAC 
CGCCGCCCTTGTCCCCGAGGGCCATGGGCCGGGTCTCAGGGCTTGTGCCCTCTCGCTTCCTGACG 
CTCCTGGCGCATCTGGTGGTCGTCATCACCTTATTCTGGTCCCGGGACAGCAACATACAGGCCTG 
CCTGCCTCTCACGTTCACCCCCGAGGAGTATGACAAGCAGGACATTCAGCTGGTGGCCGCGCTCT 
CTGTCACCCTGGGCCTCTTTGCAGTGGAGCTGGCCGGTTTCCTCTCAGGAGTCTCCATGTTCAAC 
AGCACCCAGAGCCTCATCTCCATTGGGGCTCACTGTAGTGCATCCGTGGCCCTGTCCTTCTTCAT 
ATTCGAGCGTTGGGAGTGCACTACGTATTGGTACATTTTTGTCTTCTGCAGTGCCCTTCCAGCTG 
fa TCACTGAAATGGCTTTATTCGTCACCGTCTTTGGGCTGAAAAAGAAACCCTTCTGATTAGCTTCA 
W TGACGGGAACCTAAGGACGAAGCCTACAGGGGCAAGGGCCGCTTCGTATTCCTGGAAGAAGGAAG 
p GCATAGGCTTCGGTTTTCCCCTCGGAAACTGCTTCTGCTGGAGGATATGTGTTGGAATAATTACG 

71 T CT T GAGT CTGGGAT TAT CCGCATT GTAT T TAGTGC T T TGTAAT AAAATAT GT T T TG TAGTAACA 

pi 

KJ T T AAGAC T T ATATACAGT T T TAGGGGACAATT AAAAAAAAAAAA 

f't 
II! 

m ■ 

rt 

Ill 
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FIGURE go 

MGRVSGLVPSRFLTLLAHLWVITLFWSRDSNIQACLPLTFTPSEYDKQDIQLVAALSVTLGLFA 
VSLAGFLSGVSMFNSTQSLISIGAHCSASVALSFFIFERWEGTTYWYIFVFCSALPAVTEMALFV 
TVFGLKKKPF 

Transmembrane domain: 

amino acids 12-28 (type II), 51-66, 107-124 
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FIGURE qi 



CT GGGACCCC GAAAAGAGAAGGGGAGAGCGAGGGGACGAGAGC GGAGGAGGAAGATGCAAC T GAC 
TCGCTGCTGCTTCGTGTTCCTGGTGCAGGGTAGCCTCTATCTGGTCATCTGTGGCCAGGATGATG 
GTCCTCCCGGCTCAGAGGACCCTGAGCGTGATGACCACGAGGGCCAGCCCCGGCCCCGGGTGCCT 
CGGAAGCGGGGCCACATCTCACCTAAGTCCCGCCCCATGGCCAATTCCACTCTCCTAGGGCTGCT 
GGCCCCGCCTGGGGAGGCTTGGGGCATTCTTGGGCAGCCCCCCAACCGCCCGAACCACAGCCCCC 
CACCCTCAGCCAAGGTGAAGAAAATCTTTGGCTGGGGCGACTTCTACTCCAACATCAAGACGGTG 
GCCCTGAACCTGCTCGTCACAGGGAAGATTGTGGACCATGGCAATGGGACCTTCAGCGTCCACTT 
hk CCAACACAATGCCACAGGCCAGGGAAACATCTCCATCAGCCTCGTGCCCCCCAGTAAAGCTGTAG 

AGTTCCACCAGGAACAGCAGATCTTCATCGAAGCCAAGGCCTCCAAAATCTTCAACTGCCGGATG 
GAGTGGGAGAAGGTAGAACGGGGCCGCCGGACCTCGGTTTGCACCCACGACCCAGCCAAGATCTG 
CTCCCGAGACCACGCTCAGAGCTCAGCCACCTGGAGCTGCTCCCAGCCCTTCAAAGTCGTCTGTG 
TCTACAT C GCC T TC TACAGCAC GGAC T AT CGGC T GGT CCAGAAGG T G T GC C CAGAT TACAAC TAC 



iw-l 



m 



hi CATAGTGATACCCCCTACTACCCATCTGGGTGACCCGGGGCAGGCCACAGAGGCCAGGCCAGGGC 



y TGGAAGGACAGGCGTGCCCATGCAGGAGACCATCTGGACACCGGGCAGGGAAGGGGTTGGGCCTC 

AGGCAGGGAGGGGGGTGGAGACGAGGAGATGCCAAGTGGGGCCAGGGCCAAGTCTCAAGTGGCAG 
AGAAAGGGTCCCAAGTGCTGGTCCCAACCTGAAGCTGTGGAGTGACTAGATCACAGGAGCACTGG 



jS 



|3 AGGAGGAGTGGGC T C T C TGTGCAGC C TCACAGGGCT T TGC CAC GGAGCCACAGAGAGATGC T GGG 

Bj . TCCCCGAGGCCTGTGGGCAGGCGGATCAGTGTGGCCCCAGATCAAGTCATGGGAGGAAGCTAAGC 
N r CCTTGGTTCT T GCCATCC T GAGGAAAG AT AGCAAC AGGGAGGGGGAGAT T T CATCAGTG TGGACA 

ru 

GCCTGTCAACTTAGGATGGATGGCTGAGAGGGCTTCCTAGGAGCCAGTCAGCAGGGTGGGGTGGG 
GCCAGAGGAGCTCTCCAGCCCTGCCTAGTGGGCGCCCTGAGCCCCTTGTCGTGTGCTGAGCATGG 
CATGAGGCTGAA.GTGGCAACCCTGGGGTCTTTGATGTCTTGACAGATTGACCATCTGTCTCCAGC 
CAGGCCACCCCTTTCCAAAATTCCCTCTTCTGCCAGTACTCCCCCTGTACCACCCATTGCTGATG 
GCACACCCATCCTTAAGCTAAGACAGGACGATTGTGGTCCTCCCACACTAAGGCCACAGCCCATC 
CGCGTGCTGTGTGTCCCTCTTCCACCCCAACCCCTGCTGGCTCCTCTGGGAGCATCCATGTCCCG 
GAGAGGGGTCCCTCAACAGTCAGCCTCACCTGTCAGACCGGGGTTCTCCCGGATCTGGATGGCGC 
CGCCCTCTCAGCAGCGGGCACGGGTGGGGCGGGGCCGGGCCGCAGAGCATGTGCTGGATCTGTTC 
TGTGTGTCTGTCTGTGGGTGGGGGGAGGGGAGGGAAGTCTTGTGAAACCGCTGATTGCTGACTTT 
TGTGTGAAGAATCGTGTTCTTGGAGCAGGAAATAAAGCTTGCCCCGGGGCA 
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FIGURE 02 



MQLTRCCFVFLVQGSLYLVICGQDDGPPGSEDPERDDHEGQPRPRVPRKRGHISPKSRPMANSTL 
LGLLAPPGEAWGI LGQPPNRPNHSPPPSAKVKKI FGWGDFYSNI KTVALNLLVTGKI VDHGNGT F 
SVHFQHNATGQGNI S ISLVPPSKAVEFHQEQQI FIEAKASKI FNCRMEWEKVERGRRTSLCTHDP 
AKICSRDHAQSSATWSCSQPFKWCVYIAFYSTDYRLVQKVCPDYNYHSDTPYYPSG 

Important features of the protein: 
Signal peptide: 




amino acids 1-14 




N-glycosylation sites. 



amino acids 62-65, 127-130, 137-140, 143-146 




2-oxo acid dehydrogenases acyl transferase 



amino acids 61-71 
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FIGURE Q3 

CGGTGGCCATGA.CTGCGGCCGTGTTCTTCGGCTGCGCCTTCATTGCCTTCGGGCCTGCGCTCGCC 
CTTTATGTCTTCACCATCGCCATCGAGCCGTTGCGTATCATCTTCCTCATCGCCGGAGCTTTCTT 
CTGGTTGGTGTCTCTACTGATTTCGTCCCTTGTTTGGTTCATGGCAAGAGTCATTATTGACAACA 
AAG AT G G AC C AAC AC AGAAAT AT C T GC T GAT C T T T G GAGC GT TTGTCTCTGTC TAT AT C C AAG AA 
ATG T T CC GAT T TGCATAT T ATAAAC T CT TAAAAAAAGCCAGT GAAGGTT T GAAGAGTATAAACCC 
AGGTGAGACAGCACCCTCTATGCGACTGCTGGCCTATGTTTCTGGCTTGGGCTTTGGAATCATGA 
GTGGAGTATTTTCCTTTGTGAATACCCTATCTGACTCCTTGGGGCCAGGCACAGTGGGCATTCAT 
GGAGATTCTCCTCAATTCTTCCTTTATTCAGCTTTCATGACGCTGGTCATTATCTTGCTGCATGT 
ATTCTGGGGCATTGTATTTTTTGATGGCTGTGAGAAGAAAAAGTGGGGCATCCTCCTTATCGTTC 
TCCTGACCCACCTGCTGGTGTCAGCCCAGACCTTCATAAGTTCTTATTATGGAATAAACCTGGCG 
TCAGCATTTATAATCCTGGTGCTCATGGGCACCTGGGCATTCTTAGCTGCGGGAGGCAGCTGCCG 
AAGCCTGAAACTCTGCCTGCTCTGCCAAGACAAGAACTTTCTTCTTTACAACCAGCGCTCCAGAT 
AACCTCAGGGAACCAGCAC T T CCCAAACCGCAGAC TACATC T T TAGAGGAAGCACAAC TGTGCCT 
T TT T C TGAAAATCCC T TTTTCTGGTG GAAT TGAGAAAGAAATAAAACT AT GCAGATA 
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FIGURE Q4. 



MTAAVFFGCAFIAFGPALALYVFT I AI E PLRI I FL I AGAFFWLVSLL I S S LVW FMARVI I DNKDG 
PTQKYLLIFGAFVSVYIQSMFRFAYYKLLKKASEGLKSINPGETAPSMRLLAYVSGLGFGIMSGV 
FSFVNTLSDSLGPGTVGIHGDSPQFFLYSAFMTLVIILLHVFWGIVFFDGCEKKKWGILLIVLLT 
HLLVSAQTFISSYYGINLASAFIILVLMGTWAFLAAGGSCRSLKLCLLCQDKNFLLYNQRSR 

Important features of the protein: 
Signal peptide: 

amino acids 1-19 

Transmembrane domains: 



14 
h 

ft amino acids 32-51, 119-138, 152-169, 216-235 

IAI 



Glycosaminoglycan attachment site. 



|];| amino acids 120-123 



Sodium: neurotransmitter symporter family protein 

amino acids 31-65 
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FIGURE 

AAT T T T TCACCAGAGTAAAC T TGAGAAACCAAC TGGACC T T GAGTAT T GT ACAT T T T GCCT CGT G 
GACCCAAAGGTAGCAATCTGAAACATGAGGAGTACGATTCTACTGTTTTGTCTTCTAGGATCAAC 
TCGGTCATTACCACAGCTCAAACCTGCTTTGGGACTCCCTCCCACAAAACTGGCTCCGGATCAGG 
GAACACTACCAAACCAACAGCAGTCAAATCAGGTCTTTCCTTCTTTAAGTCTGATACCATTAACA 
CAGATGCTCACACTGGGGCCAGATCTGCATCTGTTAAATCCTGCTGCAGGAATGACAGCTGGTAC 
CCAGACCCACCCATTGACCCTGGGAGGGTTGAATGTACAACAGCAACTGCACCCACATGTGTTAC 
CAAT TT T T GT CACACAAC T TGGAGC CCAGGGCACTATCC TAAGCT CAGAGGAAT TGCCACAAAT C 
TTCACGAGCCTCATCATCCATTCCTTGTTCCCGGGAGGCATCCTGCCCACCAGTCAGGCAGGGGC 
TAATCCAGATGTCCAGGATGGAAGCCTTCCAGCAGGAGGAGCAGGTGTAAATCCTGCCACCCAGG 
GAACCCCAGCAGGCCGCCTCCCAACTCCCAGTGGCACAGATGACGACTTTGCAGTGACCACCCCT 
GCAGGCATCCAAAGGAGCACACATGCCATCGAGGAAGCCACCACAGAATCAGCAAATGGAATTCA 
G TAAG CTGTTTCAAATTTTTTCAACTAAGCTGCCTCGAATTTGGTGATACATGTGAATCTTTATC 
AT T GAT TATAT TATGGAATAGAT TGAGACACAT TGGATAGTCT TAGAAGAAAT TAATTCT TAATT 
TACCTGAAAATATTCTTGAAATTTCAGAAAATATGTTCTATGTAGAGAATCCCAACTTTTAAAAA 
CAATAATTCAATGGATAAATCTGTCTTTGAAATATAACATTATGCTGCCTGGATGATATGCATAT 
T AAAAC AT AT T T G G AAAAC T G G AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE q6 



MRSTILLFCLLGSTRSLPQLKPALGLPPTKLAPDQGTL-PNQQQSNQVFPSLSLIPLTQM 
LTLGPDLHLLNPAAGMTPGTQTHPLTLGGLNVQQQLHPHVLP I FVTQLGAQGT ILS SEE 
LPQIFTSLIIHSLFPGGILPTSQAGANPDVQDGSLPAGGAGWPATQGTPAGRLPTPSG 
TDDDFAVTTPAGIQRSTHAIEEATTESANGIQ 

Signal peptide: 

amino acids 1-16 
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FIGURE Q7 



GCTCAAGTGCCCTGCCTTGCCCCACCCAGCCCAGCCTGGCCAGAGCCCCCTGGAGAAGGAGCTCT 
CTTCTTGCTTGGCAGCTGGACCAAGGGAGCCAGTCTTGGGCGCTGGAGGGCCTGTCCTGACCATG 
GTCCCTGCCTGGCTGTGGCTGCTTTGTGTCTCCGTCCCCCAGGCTCTCCCCAAGGCCCAGCCTGC 
AGAGCTGTCTGTGGAAGTTCCAGAAAACTATGGTGGAAATTTCCCTTTATACCTGACCAAGTTGC 
CGCTGCCCCGTGAGGGGGCTGAAGGCCAGATCGTGCTGTCAGGGGACTCAGGCAAGGCAACTGAG 
GGCCCATTTGCTATGGATCCAGATTCTGGCTTCCTGCTGGTGACCAGGGCCCTGGACCGAGAGGA 
GCAGGCAGAGTACCAGCTACAGGTCACCCTGGAGATGCAGGATGGACATGTCTTGTGGGGTCCAC 
AGCCTGTGCTTGTGCACGTGAAGGATGAGAATGACCAGGTGCCCCATTTCTCTCAAGCCATCTAC 
AGAGCTCGGCTGAGCCGGGGTACCAGGCCTGGCATCCCCTTCCTCTTCCTTGAGGCTTCAGACCG 
GGAT GAGCCAGGCACAGCCAAC T CGGATCTT CGAT T CCACATCC TGAGCCAGGC T CCAGCCCAGC 
CTTCCCCAGACATGTTCCAGCTGGAGCCTCGGCTGGGGGCTCTGGCCCTCAGCCCCAAGGGGAGC 
ACCAGCC T T GACCACGCCCTGGAGAGGACC TACCAGC T G T T GGT ACAGGT CAAGGACATGGGTGA 
CCAGGCCTCAGGCCACCAGGCCACTGCCACCGTGGAAGTCTCCATCATAGAGAGCACCTGGGTGT 
CCCTAGAGCCTATCCACCTGGCAGAGAATCTCAAAGTCCTATACCCGCACCACATGGCCCAGGTA 
#11 CACTGGAGTGGGGGTGATGTGCACTATCACCTGGAGAGCCATCCCCCGGGACCCTTTGAAGTGAA 

TGCAGAGGGAAACC TC TACGTGACCAGAGAGC T GGACAGAGAAGCC CAGGCT GAGTACCT GC TCC 



til 



AGGTGCGGGCTCAGAATTCCCATGGCGAGGACTATGCGGCCCCTCTGGAGCTGCACGTGCTGGTG 
H ATGGATGAGAATGACAACGTGCCTATCTGCCCTCCCCGTGACCCCACAGTCAGCATCCCTGAGCT 
y CAGTCCACCAGGTACTGAAGTGACTAGACTGTCAGCAGAGGATGCAGATGCCCCCGGCTCCCCCA 
JfJ ATTCCCACGTTGTGTATCAGCTCCTGAGCCCTGAGCCTGAGGATGGGGTAGAGGGGAGAGCCTTC 

CAGGTGGACCCCACTTCAGGCAGTGTGACGCTGGGGGTGCTCCCACTCCGAGCAGGCCAGAACAT 
*. CCTGCTTCTGGTGCTGGCCATGGACCTGGCAGGCGCAGAGGGTGGCTTCAGCAGCACGTGTGAAG 
U TCGAAGTCGCAGTCACAGATATCAATGATCACGCCCCTGAGTTCATCACTTCCCAGATTGGGCCT 
III ATAAGCCTCCCTGAGGATGTGGAGCCCGGGACTCTGGTGGCCATGCTAACAGCCATTGATGCTGA 
f s s CCTCGAGCCCGCCTTCCGCCTCATGGATTTTGCCATTGAGAGGGGAGACACAGAAGGGACTTTTG 

GCCTGGATTGGGAGCCAGACTCTGGGCATGTTAGACTCAGACTCTGCAAGAACCTCAGTTATGAG 
5;- GCAGCTCCAAGTCATGAGGTGGTGGTGGTGGTGCAGAGTGTGGCGAAGCTGGTGGGGCCAGGCCC 
Q AGGCCCTGGAGCCACCGCCACGGTGACTGTGCTAGTGGAGAGAGTGATGCCACCCCCCAAGTTGG 
HI ACCAGGAGAGCTACGAGGCCAGTGTCCCCATCAGTGCCCCAGCCGGCTCTTTCCTGCTGACCATC 

CAGCCCTCCGACCCCATCAGCCGAACCCTCAGGTTCTCCCTAGTCAATGACTCAGAGGGCTGGCT 
CTGCATTGAGAAATTCTCCGGGGAGGTGCACACCGCCCAGTCCCTGCAGGGCGCCCAGCCTGGGG 
ACACCTACACGGTGCTTGTGGAGGCCCAGGATACAGCCCTGAGTCTTGCCCCTGTGCCCTCCCAA 
TACCTCTGCACACCCCGCCAAGACCATGGCTTGATCGTGAGTGGACCCAGCAAGGACCCCGATCT 
GGCCAGTGGGCACGGTCCCTACAGCTTCACCCTTGGTCCCAACCCCACGGTGCAACGGGATTGGC 
GCCTCCAGACTCTCAATGGTTCCCATGCCTACCTCACCTTGGCCCTGCATTGGGTGGAGCCACGT 
GAACACATAATCCCCGTGGTGGTCAGCCACAATGCCCAGATGTGGCAGCTCCTGGTTCGAGTGAT 
CGTGTGTCGCTGCAACGTGGAGGGGCAGTGCATGCGCAAGGTGGGCCGCATGAAGGGCATGCCCA 
CGAAGCTGTCGGCAGTGGGCATCCTTGTAGGCACCCTGGTAGCAATAGGAATCTTCCTCATCCTC 
AT T T T CACCCAC TGGACCATGT CAAGGAAGAAGGACCCGGATCAAC CAGCAGACAGCGTGC CCCT 
GAAGGCGACTGTC TGAA TGGCCCAGGCAGCTCTAGCTGGGAGCTTGGCCTCTGGCTCCATCTGAG 
TCCCCTGGGAGAGAGCCCAGCACCCAAGATCCAGCAGGGGACAGGACAGAGTAGAAGCCCCTCCA 
TCTGCCCTGGGGTGGAGGCACCATCACCATCACCAGGCATGTCTGCAGAGCCTGGACACCAACTT 
TATGGACTGCCCATGGGAGTGCTCCAAATGTCAGGGTGTTTGCCCAATAATAAAGCCCCAGAGAA 
CTGGGCTGGGCCCTATGGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAG 
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FIGURE q8 



MVPAWLWLLCVSVPQALPKAQPAELSVEVPENYGGNFPLYLTKLPLPREGAEGQIVLSGDSGKAT 
EGPFAMDPDSGFLLVTRALDREEQAEYQLQVTLEMQDGHVLWGPQPVLVHVKDENDQVPHFSQAI 
YRARLSRGTRPG I P FL FLEAS DRDE PGTANS DLRFH I LS QAPAQPS PDMFQLE PRLGALALS PKG 
S T SLDHALERT YQLLVQVKDMGDQAS GHQATATVEVS I 1 E S TWVSLEP I HLAENLKVL YPHHMAQ 
VHWSGGDVHYHLESHPPGPFEVNAEGNLYVTRELDREAQAEYLLQVRAQNSHGEDYAAPLELHVL 
VMDENDNVPICPPRDPTVSIPELSPPGTEVTRLSAEDADAPGSPNSHWYQLLSPEPEDGVEGRA 
FQVDPTSGSVTLGVLPLRAGQNILLLVLAMDLAGAEGGFSSTCEVEVAVTDINDPIAPEFITSQIG 
PISLPEDVEPGTLVAMLTAIDADLEPAFRLMDFAIERGDTEGTFGLDWEPDSGHVRLRLCKNLSY 
EAAPSHEVVVWQSVAKLVGPGPGPGATATVTVLVERVMPPPKLDQES YEASVP ISAPAGS FLLT 
IQPSDPISRTLRFSLVNDSEGWLCIEKFSGEVHTAQSLQGAQPGDTYTVLVEAQDTALTLAPVPS 
QYLCTPRQDHGLIVSGPSKDPDIASGHGPYSFTLGPNPTVQRDWRLQTLNGSHAYLTLALHWVEP 
REH 1 1 PWVS HNAQMWQLL VRV I VCRCNVEGQCMRKVGRMKGMP TKL S AVG I L VG T L VAI G I FL I 
LI FTHWTMSRKKDPDQPADS VPLKATV 

Signal peptide: 

amino acids 1-18 

Transmembrane domain: 

amino acids 762-784 
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FIGURE QQ 



GGCTGACCGTGCTACATTGCCTGGAGGAAGCCTAAGGAACCCAGGCATCCAGCTGCCCACGCCTG 
AGT CCAAGAT TCT T CCCAGGAACACAAAC GTAGGAGACCCACGC T C C TGGAAGCACCAGCC T TTA 
TCTCTTCACCTTCAAGTCCCCTTTCTCAAGAATCCTCTGTTCTTTGCCCTCTAAAGTCTTGGTAC 
ATG TAGGACCCAGGCAT C T TGC T TT CC AGCCACAAAGAGACAG ATGA AGAT GCAGAAAGGAAATG 
TTCTCCTTATGTTTGGTCTACTATTGCATTTAGAAGCTGCAACAAATTCCAATGAGACTAGCACC 
TCT GCCAACAC TGGATCCAGT GT GAT CT CCAG TGGAGCCAGCACAGCCACCAAC TCT GGG T C CAG 
TGTGACCTCCAGTGGGGTCAGCACAGCCACCATCTCAGGGTCCAGCGTGACCTCCAATGGGGTCA 
GCAT AGT CACCAAC TCT GAGT T CCATACAACC T CCAGTGGGAT CAGCACAGCCACCAACTC TGAG 
TTCAGCACAGCGTCCAGTGGGATCAGCATAGCCACCAACTCTGAGTCCAGCACAACCTCCAGTGG 
GGCCAGCACAGCCACCAACTCTGAGTCCAGCACACCCTCCAGTGGGGCCAGCACAGTCACCAACT 
s , CTGGGTCCAGTGTGACCTCCAGTGGAGCCAGCACTGCCACCAACTCTGAGTCCAGCACAGTGTCC 
JjjJ; AGTAGGGCCAGCACTGCCACCAACTCTGAGTCTAGCACACTCTCCAGTGGGG^^^ 
P CAACTCT GACTCCAGCACAACC TCCAGT GGGGC TAGCACAGCCACCAACT C TGAGTCCAGCACAA 

Q CCTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAGCACAGTGTCCAGTAGGGCCAGCACT 
m GCCACCAACT C TGAG TCCAGCACAAC CT CCAGTGGGGCCAGCACAGC CAC CAACTCT GAGTCCAG 

£1 AACGACCTCCAATGGGGCTGGCACAGCCACCAACTCTGAGTCCAGCACGACCTCCAGTGGGGCCA 
f-f GCACAGCCACCAACTCTGACTCCAGCACAGTGTCCAGTGGGGCCAGCACTGCCACCAACTCTGAG 
N TCCAGCACGACCTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAGCACGACCTCCAGTGG 
14 GGC TAGCACAGCCACCAAC TC T GAC T CCAGCACAACC TC CAGT GGGGCCGGCACAGCCACCAAC T 

CTGAGTCCAGCACAGTGTCCAGTGGGATCAGCACAGTCACCAATTCTGAGTCCAGCACACCCTCC 
AGT GGGGC CAACACAGC CACCAACT CTGAGT CCAG TACGACC T CCAGT GGGGCCAACACAGCCAC 
CAACTCTGAGTCCAGCACAGTGTCCAGTGGGGCCAGCACTGCCACCAACTCTGAGTCCAGCACAA 
CCT C CAGT GGGGTCAGCACAGCCAC CAAC TCT GAG TCCAGCACAAC C T CCAGT GGGGC TAGCACA 
GCCACCAACT C TGAC TCCAGCACAACCTCCAG TGAGGCCAGCACAGCCACCAAC TCT GAGT C TAG 
CACAGT GT CCAGTGGGATCAGCACAG T CACCAATT C T GAG TCCAGCACAACC T CCAGTGGGGCCA 
ACACAGCCACCAACTCTGGGTCCAGTGTGACCTCTGCAGGCTCTGGAACAGCAGCTCTGACTGGA 
ATGCACACAACTTCCCATAGTGCATCTACTGCAGTGAGTGAGGCAAAGCCTGGTGGGTCCCTGGT 

GCCGTGGGAAATCTTCCTCATCACCCTGGTCTCGGTTGTGGCGGCCGTGGGGCTCTTTGCTGGGC 
TCTTCTTCTGTGTGAGAAACAGCCTGTCCCTGAGAAACACCTTTAACACAGCTGTCTACCACCCT 
CATGGCCTCAACCATGGCCTTGGTCCAGGCCCTGGAGGGAATCATGGAGCCCCCCACAGGCCCAG 
GT GGAGT CCTAAC TGGT TC TGGAGGAGACCAGTAT CATC GAT AGCCAT GGAGAT GAGCGGGAGGA 
ACAGCGGGCCCTGAGCAGCCCCGGAAGCAAGTGCCGCATTCTTCAGGAAGGAAGAGACCTGGGCA 
CCCAAGACCTGGTTTCCTTTCATTCATCCCAGGAGACCCCTCCCAGCTTTGTTTGAGATCCTGAA 
AATCTTGAAGAAGGTATTCCTCACCTTTCTTGCCTTTACCAGACACTGGAAAGAGAATACTATAT 
T GC T CAT T T AGC TAAGAAATAAATACAT CT CATCTAACACACAC GACAAAGAGAAGCT GT GC T T G 
CCCCGGGGTGGGTATCTAGCTCTGAGATGAACTCAGTTATAGGAGAAAACCTCCATGCTGGACTC 
CATCTGGCATTCAftAATCTCCACAGTAAAATCCAAkGACCT^ 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



m 
m 



:; s 
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FIGURE 100 



MKMQKGNVLLMFGLLLHLEAATNSNETSTSANTGSSVISSGASTATNSGSSVTSSGVSTATISGS 
SVTSNGVSIVTNSEFHTT.SSGISTATNSEFSTASSGISIATNSESSTTSSGASTATNSESSTPSS 
GASTVTNSGSSVTSSGASTATNSESSTVSSRASTATNSESSTLSSGASTATNSDSSTTSSGASTA 
TNSESSTTSSGASTATNSESSTVSSRASTATNSESSTTSSGASTATNSESRTTSNGAGTATNSES 
STTSSGASTATNSDSSTVSSGASTATNSESSTTSSGASTATNSESSTTSSGASTATNSDSSTTSS 
GAGT ATNS ES S TVS SG I S T VTNSES S T PS S GANTATNSE S S T T S S GANTATNS ESS TVS S GAS TA 
TNSESSTTSSGVSTATNSESSTTSSGASTATNSDSSTTSSEASTATNSESSTVSSGISTVTNSES 
STTSSGANTATNSGSSVTSAGSGTAALTGMHTTSHSASTAVSEAKPGGSLVPWEIFLITLVSWA 



AVGLFAGLFFCVRNSLSLRNTFNTAVYHPHGLNHGLGPGPGGNHGAPHRPRWSPNWFWRRPVSSI 




Signal peptide: 



AMEMSGRNSGP 



amino acids 1-20 




Transmembrane domain: 



amino acids 510-532 
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FIGURE 101 



GGCCGGACGCCTCCGCGTTACGGGATGAATTAACGGCGGGTTCCGCACGGAGGTTGTGACCCCTA 
CGGAGCCCCAGCTTGCCCACGCACCCCACTCGGCGTCGCGCGGCGTGCCCTGCTTGTCACAGGTG 
GGAGGCT GGAAC TAT CAGGC TGAAAAACAGAGTGGG TACT C TCT TCT GGGAAGCT GGCAACAAAT 
GGATGATGTGATATATGCATTCCAGGGGAAGGGAAATTGTGGTGCTTCTGAACCCATGGTCAATT 
AACGAGGCAGT T TC TAGC TACT GCACGT ACT TCATAAAGCAGGACT CTAAAAGC T TT GGAATCAT 
GGTGTCATGGAAAGGGATTTACTTTATACTGACTCTGTTTTGGGGAAGCTTTTTTGGAAGCATTT 
TCATGCTGAGTCCCTTTTTACCTTTGATGTTTGTAAACCCATCTTGGTATCGCTGGATCAACAAC 
k-k C GCC T T GT GGCAACAT GGC TC AC CC T ACC T GT GGCAT TAT TGG AGAC CAT GT T T GGT GTAAAAGT 

C! GAT T AT AAC T GGGGATGCAT T T G T T CC T GGAGAAAGAAG TG TC AT TAT CAT GAACCATC GGACAA 

GAATGGACTGGATGTTCCTGTGGAATTGCCTGATGCGATATAGCTACCTCAGATTGGAGAAAATT 
TGCCTCAAAGCGAGTCTCAAAGGTGTTCCTGGATTTGGTTGGGCCATGCAGGCTGCTGCCTATAT 
CTTCAT TCATAGGAAATGGAAGGATGACAAGAGCCAT TTCGAAGACATGAT TGAT TACTTTTGTG 
ATAT TCACGAACCACTTCAACT CCTCATATTCCCAGAAGGGACTGAT CTCACAGAAAACAGCAAG 
T CT CGAAGTAATGCAT T TGCTGAAAAAAATGGACT TCAGAAAT AT GAATAT GT TT TACATCCAAG 
^ AACT ACAGGC TT TAC T T T T GT GG TAGACCG TC TAAGAGAAGGTAAGAACC T TGAT GC TGTCCATG 

pi ATATCACTGTGGCGTATCCTCACAACATTCCTCAATCAGAGAAGCACCTCCTCCAAGGAGACTTT 
PI C C C AG GG AAAT C CAC T T T C AC G T C C AC C G G TAT C C AAT AG ACAC C C T C C C C AC AT C C AAGG AGG A 

8J CCTTCAACTCTGGTGCCACAAACGGTGGGAAGAGAAAGAAGAGAGGCTGCGTTCCTTCTATCAAG 
Ml GGGAGAAGAATT TT TATTT TACCGGACAGAGTGTCAT TCCACCT TGCAAGT CTGAACTCAGGGTC 

^ CTTGTGGTCAAATTGCTCTCTATACTGTATTGGACCCTGTTCAGCCCTGCAATGTGCCTACTCAT 

ATATTTGTACAGTCTTGTTAAGTGGTATTTTATAATCACCATTGTAATCTTTGTGCTGCAAGAGA 
GAATATTTGGTGGACTGGAGATCATAGAACTTGCATGTTACCGACTTTTACACAAACAGCCACAT 
T T AAAT T CAAAGAAAAAT GAG TAAG AT T ATAAGG T T T GC CATG T GAAAAC C T AGAGCATAT T T T G 
GAAATGTTCTAAACCTTTCTAAGCTCAGATGCATTTTTGCATGACTATGTCGAATATTTCTTACT 
GCCAT CAT TAT T TGT TAAAGAT AT T T T GC ACT T AAT T T T GTGGGAAAAATAT T GCTACAAT T T T T 
TTTAATCTCTGAATGTAATTTCGATACTGTGTACATAGCAGGGAGTGATCGGGGTGAAATAACTT 
GGGCCAGAATAT XATTAAACAATCATCAGGCTT T TAAA 
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in 
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mi 



MHSRGREIWLLNPWSINEAVSSYCTYFIKQDSKSFGIMVSWKGIYFILTLFWGSFFGSIFMLSP 
FLPLMFVNPSWYRWINNRLVATWLTLPVALLETMFGVKVI ITGDAFVPGERSVI IMNHRTRMDWM 
FL WNCLMRY S YLRLEK I C LKAS LKG VPG FG WAMQAAAY I F I HRKWKDDKS H FEDMI DY FCD I HE P 
LQLLIFPEGTDLTENSKSRSNAFAEKNGLQKYEYVLHPRTTGFTFWDRLREGKNLDAVHDITVA 
YPHNIPQSEKHLLQGDFPREIHFHVHRYPIDTLPTSKEDLQLWCHKRWEEKEERLRSFYQGEKNF 
YFTGQSVI PPCKSELRVLWKLLS I LYWTLFS PAMCLLI YLYS LVKWYFI I T I VI FVLQERI FGG 
LEI IELACYRLLHKQPHLNSKKNE 



I?! Important features of the protein: 

M Signal peptide: 

amino acids 1-22 

jUJ Transmembrane domains: 

amino acids 44-63, 90-108, 354-377 
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CGGCTCGAGCGGCTCGAGTGAAGAGCCTCTCCACGGCTCCTGCGCCTGAGACAGCTGGCCTGACC 
TCCAAATCATCCATCCACCCCTGCTGTCATCTGTTTTCATAGTGTGAGATCAACCCACAGGAATA 
TCCATGGCTTTTGTGCTCATTTTGGTTCTCAGTTTCTACGAGCTGGTGTCAGGACAGTGGCAAGT 
CACTGGACCGGGCAAGTTTGTCCAGGCCTTGGTGGGGGAGGACGCCGTGTTCTCCTGCTCCCTCT 
T T C C T GAGAC C AGT GCAGAGGC TAT G GAAG TGCGGTTCTT C AGGAAT CAG T T C CAT GC T G T GGT C 
CACCTCTACAGAGATGGGGAAGACT GGGAAT CTAAGCAGATGCCACAGTAT CGAGGGAGAAC TGA 
GTTTGTGAAGGACTCCATTGCAGGGGGGCGTGTCTCTCTAAGGCTAAAAAACATCACTCCCTCGG 
ACATCGGCCTGTATGGGTGCTGGTTCAGTTCCCAGATTTACGATGAGGAGGCCACCTGGGAGCTG 
CGGGTGGCAGCACTGGGCTCACTTCCTCTCATTTCCATCGTGGGATATGTTGACGGAGGTATCCA 
GTTACTCTGCCTGTCCTCAGGCTGGTTCCCCCAGCCCACAGCCAAGTGGAAAGGTCCAGAAGGAC 
AG GAT T T G T C T T CAG AC T C CAG AG C AAAT G G AGAT G GG T ACAG C C T G T AT GAT G T G GAG AT C T C C 
AT T AT AGT CCAGGAAAAT GCT GGGAGCATAT T G TGT TCCATCCACC T T GC T GAGCAGAGTCATGA 
GGTGGAATCCAAGGTATTGATAGGAGAGACGTTTTTCCAGCCCTCACCTTGGCGCCTGGCTTCTA 
m TTTTACTCGGGTTACTCTGTGGTGCCCTGTGTGGTGTTGTCATGGGGATGATAATTGTTTTCTTC 

AAAT CCAAAGGGAAAATCCAGGCGGAAC T GGAC TGGAGAAGAAAGCACGGACAGGCAGAAT TGAG 
fl AGACGCCCGGAAACACGCAGTGGAGGTGACTCTGGATCCAGAGACGGCTCACCCGAAGCTCTGCG 
W TTTCTGATCTGAAAACTGTAACCCATAGAAAAGCTCCCCAGGAGGTGCCTCACTCTGAGAAGAGA 
SI TTTACAAGGAAGAGTGTGGTGGCTTCTCAGGGTTTCCAAGCAGGGAGACATTACTGGGAGGTGGA 
y CGTGGGACAAAATGTAGGGTGGTATGTGGGAGTGTGTCGGGATGACGTAGACAGGGGGAAGAACA 
r! ATGT GAC T T T GTCTCC CAACAAT GGGTAT T GGGTCC T CAGAC T GACAACAGAACATT T GTAT T T C 

■ ACATTCAATCCCCATTTTATCAGCCTCCCCCCCAGCACCCCTCCTACACGAGTAGGGGTCTTCCT 
- GGACTATGAGGGTGGGACCATCTCCTTCTTCAATACAAATGACCAGTCCCTTATTTATACCCTGC 
y T GACATGTCAGT TTGAAGGCTTGTTGAGACCC TATATCCAGCATGCGATGTATGACGAGGAAAAG 

HI GGGACTCCCATATTCATATGTCCAGTGTCCTGGGG ATGA GACAGAGAAGACCCTGCTTAAAGGGC 
?*l CCCACACCACAGACCCAGACACAGCCAAGGGAGAGTGCTCCCGACAGGTGGCCCCAGCTTCCTCT 
JjJ, CCGGAGCCTGCGCACAGAGAGTCACGCCCCCCACTCTCCTTTAGGGAGCTGAGGTTCTTCTGCCC 
5f TGAGCCCTGCAGCAGCGGCAGTCACAGCTTCCAGATGAGGGGGGATTGGCCTGACCCTGTGGGAG 
f!J T CAGAAGCCATGGCT GCCC TGAAGT GGGGACGGAATAGAC TCACAT T AGGT T TAGT T TG TGAAAA 

f|| CTCCATCCAGCTAAGCGATCTTGAACAAGTCACAACCTCCCAGGCTCCTCATTTGCTAGTCACGG 

ACAGT GAT TCC T GCC TCAC AGG T GAAGAT TAAAGAGACAAC GAAT GT GAATCAT GCT T GCAGGT T 
T GAGGGCACAG T GT T TGC T AAT GAT G T GT T T T TAT AT TATACAT T T T CCCAC CATAAAC TC T G T T 
T GCT TAT T CCACAT TAAT T TAC T T T T CTCTATACCAAAT CACC CAT GGAATAGT TAT T GAACACC 
T GC T T TGT GAGGC TCAAAGAATAAAGAGGAGG TAGGAT T T T TCACT GATT C TATAAGCCCAGCAT 
TACCTGATACCAAAACCAGGCAAAGAAAACAGAAGAAGAGGAAGGAAAACTACAGGTCCATATCC 
CTCAT TAACACAGACACAAAAATTCTAAATAAAATT TTAACAAATTAAACTAAACAATATAT T TA 
AAGAT GATATATAACTAC T CAGT GT GGT T T GT CCCACAAAT GCAGAG T TGGT T TAATAT TTAAAT 
ATCAACCAGTGTAAT T CAGCACAT TAATAAAG TAAAAAAGAAAACCAT AAAAAAAAAAAAAAA 
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MAFVLI LVLS FYELVSGQWQVTGPGKFVQALVGEDAVFS C SL FPET SAEAMEVRFFRNQFHAWH 
LYRDGEDWESKQMPQYRGRTEFVKDSI'AGGRVSLRLKNITPSDIGLYGCWFSSQIYDEEATWELR 
VAALGS LPL I S I VGWDGGIQLLCLSSGW FPQPTAKWKGPQGQDLS SDSRANADGYSLYDVE I S I 
I VQENAGS I LCS IHLAEQSHEVESKVLI GET FFQPS PWRLAS I LLGLLCGALCGWMGMI I VFFK 
SKGKIQAELDWRRKHGQAELRDARKHAVEVTLDPETAHPKLCVSDLKTVTHRKAPQEVPHSEKRF 
TRKSWASQGFQAGRHYWEVDVGQNVGWYVGVCRDDVDRGKNNVTLSPNNGYWVLRLTTEHLYFT 
FNPHFISLPPSTPPTRVGVFLDYEGGTISFFNTNDQSLIYTLLTCQFEGLLRPYIQHAMYDEEKG 
TPIFICPVSWG 




Signal peptide: 



amino acids 1-17 



Transmembrane domains: 



amino acids 131-150, 235-259 
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CC TTCAC AGG AC TCTT C ATT GCTGGT TGGC A ATGA T GT AT CGGC CAGAT GTGGT GAGGGC T AGGAAAAGAG 
TTTGTTGGGAACCCTGGGTTATCGGCCTCGTCATCTTCATATCCCTGATTGTCCTGGCAGTGTGCATTGGA 
CT CACT GT T C AT T AT GTGAGAT AT AATC AAAAGAAGAC CT AC AAT T ACT AT AGC AC AT TGTC AT TT AC AAC 
TGACAAACTATATGCTGAGTTTGGCAGAGAGGCTTCTAACAATTTTACAGAAATGAGCCAGAGACTTGAAT 
CAATGGTGAAAAATGCATTTTATAAATCTCCATTAAGGGAAGAATTTGTCAAGTCTCAGGTTATCAAGTTC 
AGTC AAC AGAAGC AT GGAGT GT TGGC TCAT AT GCTGTT GATT TGT AGATT TC AC TCT ACT GAGG ATCC TG A 
AACTGTAGATAAAATTGTTCAACTTGTTTTACATGAAAAGCTGCAAGATGCTGTAGGACCCCCTAAAGTAG 
AT CCTC AC TC AGTT AAAATT AAAAAAATC AAC AAGACAGAAAC AGAC AGC TAT C TAAACC AT TGCTGC GG A 
AC ACGAAGAAGT AAAACT CT AGGT C AGAGT CT C AGGAT CGTT GG TGGG AC AG AAGT AGAAGAGGGT G AAT G 
GCCCTGGCAGGCTAGCCTGCAGTGGGATGGGAGTCATCGCTGTGGAGCAACCTTAATTAATGCCACATGGC 
TTGT GAGT GC TGCT CACT GT TT TACAAC AT AT AAGAACCCTGCC AG AT GGACTGCT T CCT TT GGAGT AAC A 
AT AAAACCT TC G AAAATGAAACGGGGTCT C CGGAGAAT AATT GT C CAT GAAAAAT ACAAAC AC CC AT C AC A 
TGACTATGATATTTCTCTTGCAGAGCTTTCTAGCCCTGTTCCCTACACAAATGCAGTACATAGAGTTTGTC 
TCCCTGATGCATCCTATGAGTTTCAACCAGGTGATGTGATGTTTGTGACAGGATTTGGAGCACTGAAAAAT 
||j GAT GGT TAC AGT C AAAAT C ATCTT CG ACAAGC ACAGGT GACT CT C AT AGACGCT AC AACT TGC AAT G AAC C 

% TCAAGCTTACAATGACGCCATAACTCCTAGAATGTTATGTGCTGGCTCCTTAGAAGGAAAAACAGATGCAT 

GC C AGG GT GACT C TGG AGG ACC ACT GGT T AGT T C AGAT GC TAGAGAT AT C TGGT ACCT T GCTGG AAT AGT G 
M AGCT GGGGAGAT GAAT GT GCGAAACC CAAC AAGCCT GGT GTT TAT ACT AG AGTT ACGGC C TT GCGGGACT G 

GATT AC TT CAAAAACT GGT ATC TAAG AG AC AAAAGC CTC ATG GAACAG AT AAC ATT TTTTTTTGTTTTTTG 
GGTGT GGAGGCC AT T T TT AGAGAT AC AG AATT GG AGAAGAC T TGCAAAAC AGCT AGAT TT GACT GATC TC A 
ATAAACTGTTTGCTTGATGCATGTATTTTCTTCCCAGCTCTGTTCCGCACGTAAGCATCCTGCTTCTGCCA 
GATC AACTCTGT CAT CT G TG AGC AAT AG TT GAAACT TT ATGT AC AT AG AGAAAT AGAT AAT ACAAT AT TAC 
ATTACAGCCTGTATTCATTTGTTCTCTAGAAGTTTTGTCAGAATTTTGACTTGTTGACATAAATTTGTAAT 
GC AT AT AT AC AATT TGAAGC ACTCCT T T TC TT C AGT TCCT C AGC TCCTCT CAT T TC AGCAAAT AT CCATT T 
T CAAGGTGC AGAAC AAGG AG TGAAAG AAAATAT AAGAAGAAAAAAATC CC CT AC AT TTT ATT GGC AC AGAA 
AAGTATTAGGTGTTTTTCTTAGTGGAATATTAGAAATGATCATATTCATTATGAAAGGTCAAGCAAAGACA 
GCAGAATACCAATCACTTCATCATTTAGGAAGTATGGGAACTAAGTTAAGGAAGTCCAGAAAGAAGCCAAG 
AT AT AT CCT TAT TT TC AT T T CC AAAC AACT ACT AT G AT AAAT GT GAAGAAGAT T CT GTT T TT T TGT GACC T 
AT AAT AATT ATACAAACT TCAT GCAATGT ACT TGT T CT AAGC AAAT T AAAGC AAAT AT T TAT T T AAC ATT G 
T TACTG AGG ATGTC AAC AT ATAACAAT AAAAT AT AAAT CACCCA 
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MMYRPDWRARKRVCWE PW VI GLV I FI SL I VLAVC I GLT VHYVRYNQKKT YNYYS TL S FTTDKL Y 
AE FGREASNNFTEMSQRLE SMVKNAFYKS PLREE FVKS QVIKFS QQKHGVLAHMLL I CRFHS TED 
PETVDKIVQLVLHEKLQDAVGPPKVDPHSVKIKKINKTETDSYLNHCCGTRRSKTLGQSLRIVGG 
TEVEEGEWPWQAS LQWDGS HRCGATL INATWLVS AAHC FTT YKNPARWTAS FGVT I KPSKMKRGL 
RRIIVHEKYKHPSHDYDISLAELSSPVPYTNAVHRVCLPDASYEFQPGDVMFVTGFGALKNDGYS 
QNHLRQAQVTLIDATTCNEPQAYNDAITPRMLCAGSLEGKTDACQGDSGGPLVSSDARDIWYLAG 
IVSWGDECAKPNKPGVYTRVTALRDWITSKTGI 

Transmembrane domain: 

amino acids 21-40 (type II) 
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FIGURE 107 



AGAGAAAGAAGCGTCTCCAGCTGAAGCCAATGCAGCCCTCCGGCTCTCCGCGAAGAAGTTCCCTG 
CCCCGATGAGCCCCCGCCGTGCGTCCCCGACTATCCCCAGGCGGGCGTGGGGCACCGGGCCCAGC 
GCCGACGATCGCTGCCGTTTTGCCCTTGGGAGTAGGATGTGGTGAAAGGATGGGGCTTCTCCCTT 
ACGGGGCTCACAATGGCCAGAGAAGATTCCGTGAAGTGTCTGCGCTGCCTGCTCTACGCCCTCAA 
TCTGCTCTTTTGGTTAATGTCCATCAGTGTGTTGGCAGTTTCTGCTTGGATGAGGGACTACCTAA 
ATAAT GT TC TCACT T TAACTGCAGAAAC GAGGGTAGAGGAAGCAGT CAT T T T GAC TTAC T T T CCT 
GTGGTTCATCCGGTCATGATTGCTGTTTGCTGTTTCCTTATCATTGTGGGGATGTTAGGATATTG 
TGGAACGGTGAAAAGAAATCTGTTGCTTCTTGCATGGTACTTTGGAAGTTTGCTTGTCATTTTCT 
GTGTAGAACTGGCTTGTGGCGTTTGGACATATGAACAGGAACTTATGGTTCCAGTACAATGGTCA 
GAT ATGGT CACT T T GAAAGCCAGGAT GACAAAT TATGGAT TACCT AGAT AT CGG TGGC T TAC TCA 
TGCTTGGAATTTTTTTCAGAGAGAGTTTAAGTGCTGTGGAGTAGTATATTTCACTGACTGGTTGG 
H AAAT GACAGAGATGGAC T GGC CC CCAGAT TCCT GC T GTGT TAGAGAAT TCCCAGGATGT TCCAAA 

Q CAGGCCCACCAGGAAGATC TCAGTGACCT TTATCAAGAGGGTT GTGGGAAGAAAATGTATTCCTT 

TTTGAGAGGAACCAAACAACTGCAGGTGCTGAGGTTTCTGGGAATCTCCATTGGGGTGACACAAA 
TCCTGGCCATGATTCTCACCATTACTCTGCTCTGGGCTCTGTATTATGATAGAAGGGAGCCTGGG 
ACAGACCAAATGAT G TCCT TGAAGAAT GACAACTCTCAGCACC TGT CATGTCCC TCAGTAGAAC T 
M GT T GAAACCAAGCC TGTCAAGAATCT TT GAACACACATCCATGGCAAACAGCT T TAATACACAC T 

T TGAGAT GGAGGAG T TATAAAAAGAAAT GT CACAGAAGAAAAC CACAAAC T T GT T T TATT GGAC T 
y | TGT GAAT T T TTGAGTACATACT AT GT GT T T CAGAAATAT GTAGAAATAAAAATGT TGCCATAAAA 

f$) TAACACCTAAGCATATACTATTCTATGCTT TAAAATGAGGATGGAAAAGT TTCATGTCATAAGTC 

1 ^ ACCACCT GGACAATAATT GATGCCC T TAAAATGC T GAAGACAGATGTCAT ACCCACTG TGTAGCC 

*f TGTGTATGACTTTTACTGAACACAGTTATGTTTTGAGGCAGCATGGTTTGATTAGCATTTCCGCA 
C) TCCATGCAAACGAGTCACATATGGTGGGACTGGAGCCATAGTAAAGGTTGATTTACTTCTACCAA 
||! C TAGTAT ATAAAGTAC TAAT TAAATGCTAACATAGGAAGT TAGAAAAT ACT AATAAC TT TTAT TA 

C T C AG C GAT C TAT T C T T C T GAT GC T AAAT AAAT TAT AT AT CAG AAAAC T T T C AAT AT T GG T GAC T 
gjj ACCTAAAT GT GATT T T TGC TG GT T ACT AAAATAT TC T TACCAC T TAAAAGAGCAAGC TAACACAT 

W|f T GT C T T AAGC T GATCAGGGAT T T TT T G TATATAAG T C TGTGT TAAAT CT GTATAATT CAGT CGAT 

Cl T TCAGT T C T GAT AATGT TAAGAAT AACCAT TATGAAAAGGAAAATTT GT CC TGTATAGCAT CATT 

AT T T T T AGCC T T TC C TGT TAAT AAAGCT T TAC TAT TC TG TCCT GGGC T TAT AT TACACATATAAC 
TGT TAT T TAAAT AC T T AAC CAC T AAT T TT GAAAAT TAC CAG TGT GAT AC AT AGGAAT CAT TAT T C 
AGAAT GTAGT C T GGTC T T TAG GAAG TATT AAT AAGAAAAT T TGCACATAAC TTAGT T GAT T CAGA 
AAGGACT T G TAT GC TGT T T TTCT CC CAAAT GAAGACTCT T T TT GACACTAAACAOT T T T TAAAAA 
GCTT ATC T TT GCC T T CTCCAAACAAGAAGCAATAGT C TCCAAGT CAATATAAAT TC TACAGAAAA 
T AGT GTTCTTTT TC TCCAGAAAAAT GC T T GTGAGAAT CAT TAAAACAT GTGACAAT T TAGAGAT T 
CTTTGTTT TAT T T CACT GAT TAAT AT AC TGT GGCAAAT TAC AC AG AT TAT TAAAT T T T T T TAC AA 
GAG TATAGTATAT T TAT T T GAAATGGGAAAAGT GCAT TT TACT GT AT T T TGTGTAT T T TGT T TAT 
TTCT CAGAAT AT GGAAAGAAAAT TAAAAT GT GTCAATAAAT AT TT TCTAGAGAGTAA 
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MAREDSVKCLRCLLYALNLLFWLMS I SVLAVSAWMRDYLNNVLTLTAETRVEEAVILTYFPWHP 
VMIAVCCFLI IVGMLGYCGTVKRNLLLLAWYFGSLLVI FCVELACGVWTYEQELMVPVQWSDMVT 
LKARMTNYGLPRYRWLTHATOFFQREFKCCGWYFTDWLEMTEMDWPPDSCCVREFPGCSKQAHQ 
EDLSDLYQEGCGKKMYS FLRGTKQLQVLRFLGI S IGVTQILAMILTITLLWALYYDRREPGTDQM 
MSLKNDNSQHLSCPSVELLKPSLSRIFEHTSMANSFNTHFEMEEL 

Signal peptide: 

amino acids 1-33 




Transmembrane domains: 



amino acids 12-35, 57-86, 94-114, 226-248 
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CCAAGGCCAGAGCTGTGGACACCTTATCCCACTCATCCTCATCCTCTTCCTCTGATAAAGCCCCTACCAGTGCT 
GATAAAGTCTTTCTCGTGAGAGCCTAGAGGCCTTAAAAAAAAAAGTGCTTGAAAGAGAAGGGGACAAAGGAACA 
CCAGTATTAAGAGGATTTTCCAGTGTTTCTGGCAGTTGGTCCAGAAGGATGCCTCCAYTCCTGCTTCTCACCTG 
CCTCTTCATGACAGGCACCTCCGTGTCACCCGTGGCCCTAGATCCTTGTTCTGCTTACATCAGCCTGAATGAGC 
CCTGGAGGAACACTGACCACCAGTTGGATGAGTCTCAAGGTCCTCCTCTATGTGACAACCATGTGAATGGGGAG 
T GGT ACC ACTT CAC GGGC ATGGCGGGAGAT GCCAT GCCT ACCT TCT GCATACC AGAAAAC CACT GTGGAACCCA 
CGCACCTGTCTGGCTCAATGGCAGCCACCCCCTAGAAGGCGACGGCATTGTGCAACGCCAGGCTTGTGCCAGCT 
TCAATGGGAACTGCTGTCTCTGGAACACCACGGTGGAAGTCAAGGCTTGCCCTGGAGGCTACTATGTGTATCGT 
CTGACCAAGCCCAGCGTCTGCTTCCACGTCTACTGTGGTCATTTTTATGACATCTGCGACGAGGACTGCCATGG 
CAGCTGC TCAGAT ACCAGC GAGT GCAC AT GC GCTCC AGGAACT GT GC TAGGCCCT GACAGGC AGACATGCT T TG 
ATGAAAAT GAAT GT GAGCAAAAC AACGGTGGCT GC AGT GAGATCTGT GT GAACCTC AAAAACTCCTACCGCT GT 
GAGT GT G GG GT T GGCC GT GT GCT AAGAAGT GAT GGC AAGAC TT GT GAAGAC GT T GAAGGAT GCC AC AAT AAC AA 
TGGTGGCTGCAGCCACTCTTGCCTTGGATCTGAGAAAGGCTACCAGT GTGAATGTCCCCGGGGCCTGGTGCTGT 
CT GAGGAT AACCACACTTGCCAAGT CCCTGT GTTGT GCAAATC AAAT GCCATTGAAGT GAACAT CCCC AGGGAG 
CTGGTTGGTGGCCTGGAGCTCTTCCTGACCAACACCTCCTGCCGAGGAGTGTCCAACGGCACCCATGTCAACAT 
CCT CT T CT CTC TC AAGAC AT GT GGT AC AGT G GT C GAT GT G GT GAAT G ACAAGAT T GT GGC C AGC AACC TC GT GA 
CAGGTCTACCCAAGCAGACCCCGGGGAGCAGCGGGGACTTCATCATCCGAACCAGCAAGCTGCTGATCCCGGTG 
ACCTGCGAGTTTCCACGCCTGTACACCATTTCT GAAGGAT ACGTTCCCAACCTTCGAAACTCCCCACTGGAAAT 
CAT GAGC C GAAAT CAT G GGAT CT T C CC ATT C ACTC T GGAGATC T TC AAGGAC AAT GAGT T T GAAGAGC CT T ACC 
GGGAAGCTCTGCCCACCCTCAAGCTTCGTGACTCCCTCTACTTTGGCATTGAGCCCGTGGTGCACGTGAGCGGC 
TTGGAAAGCTTGGTGGAGAGCTGCTTTGCCACCCCCACCTCCAAGATCGACGAGGTCCTGAAATACTACCTCAT 
CCGGGATGGCTGTGTTTCAGATGACTCGGTAAAGCAGTACACATCCCGGGATCACCTAGCAAAGCACTTCCAGG 
TCCCTGTCTTCAAGTTTGTGGGCAAAGACCACAAGGAAGTGTTTCTGCACTGCCGGGTTCTTGTCTGT GGAGTG 
TTGGACGAGCGTTCCCGCTGTGCCCAGGGTTGCCACCGGCGAATGCGTCGTGGGGCAGGAGGAGAGGACTCAGC 
CGGTCTACAGGGCCAGACGCTAACAGGCGGCCCGATCCGCATCGACTGGGAGGACTAGTTCGTAGCCATACCTC 
GAGTCCCTGCATTGGACGGCTCTGCTCTTTG'GAGCTTCTCCCCCCACCGCCCTCTAAGAACATCTGCCAACAGC 
TGGGTTCAGACTTCACACTGTGAGTTCAGACTCCCAGCACCAACTCACTCTGATTCTGGTCCATTCAGTGGGCA 
CAGGTCACAGCACTGCTGAACAATGTGGCCTGGGTGGGGTTTCATCTTTCTAGGGTTGAAAACTAAACTGTCCA 
CCCAGAAAGACACTCACCCCATTTCCCTCATTTCTTTCCTACACTTAAATACCTCGTGTATGGTGCAATCAGAC 
CACAAAAT CAGAAGCTGGGT AT AATAT TTCAAGTT ACAAACCCT AGAAAAAT T AAACAGT TACT GAAATTATGA 
CTTAAATACCCAATGACTCCTTAAATATGTAAATTATAGTTATACCTTGAAATTTCAATTCAAATGCAGACTAA 
TTATAGGGAAT TT GG AAGT GT AT C AAT AAAAC AGT AT AT AAT T T T 



FIGURE no 

MPPFLLLTCLFITGTSVSPVALDPCSAYISLNEPWRNTDHQLDESQGPPLCDNHVNGEWYHFTGMAGDAMP 
T FC I PENHCGTHAPWLNGSHPLEGDGI VQRQACAS FNGNCCLWNTTVEVKACPGGY Y VYRLTKPS VC FH V 
YCGHFYDICDEDCHGSCSDTSECTCAPGTVLGPDRQTCFDENECEQNNGGCSEICVNLECNSYRCECGVGRV 
LRSDGKTCEDVEGCHNNNGGCSHSCLGSEKGYQCECPRGLVLSEDNHTCQVPVLCKSNAIEVNIPRELVGG 
LELFLTNTSCRGVSNGTHVNILFSLKTCGTWDWNDKIVASNLVTGLPKQTPGSSGDFIIRTSKLLIPVT 
CE FPRLYT ISEGYVPNLRNSPLEIMSRNHGI FPFTLEI FKDNE FEEPYREALPTLKLRDSLY FGIEPWHV 
SGLESLVESCFATPTSKIDEVLKYYLIRDGCVSDDSVKQYTSRDHLAKHFQVPVFKFVGKDHKEVFLHCRV 
LVCGVLDERSRCAQGCHRRMRRGAGGEDSAGLQGQTLTGGPIRIDWED 

Important features of the protein: 
Signal peptide: 

amino acids 1-16 

N-glycosylation sites. 

amino acids 89-93, 116-120, 259-263, 291-295, 299-303 

Tyrosine kinase phosphorylation sites. 

amino acids 411-418, 443-451 

N-myristoylation sites. 

amino acids 226-232, 233-239, 240-246, 252-258, 296-302, 300-306, 
522-528, 531-537 

Aspartic acid and asparagine hydroxylation site. 

amino acids 197-209 

ZP domain proteins. 

amino acids 431-457 

Calcium-binding EGF-like proteins. 

amino acids 191-212, 232-253 
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GAGAG AGG C AGC AGCT TGCT CAGCGG AC AAGGAT GC T G GGCGT G AGGGAC C AAG GCCT GCCCTGC AC T CGG 
GCCTCCTCCAGCCAGTGCTGACCAGGGACTTCTGACCTGCTGGCCAGCCAGGACCTGTGTGGGGAGGCCCT 
CCTGCT GCCTTGGGGTGAC AAT CT CAGCT C C AGGCT AC AGGGAGACCGGG AGGATC AC AGAGCC AGC ATGT 
TACAGGAT CCTG AC AGT GAT CAACCT CT GAAC AGCC T C G ATGT C AAACCC CT GCGC AAAC CC CGT AT C CC C 
ATGGAGACCTTCAGAAAGGTGGGGATCCCCATCATCATAGCACTACTGAGCCTGGCGAGTATCATCATTGT 
GGTTGTCCTCATCAAGGTGATTCTGGATAAATACTACTTCCTCTGCGGGCAGCCTCTCCACTTCATCCCGA 
GG AAGCAGCT GT GT GACGGAGAGCTGGACT GT CCCTTGGG GGAGGACGAGGAGC AC T GTGTC AAGAGC TT C 
CCCGAAGGGCCTGCAGTGGCAGTCCGCCTCTCCAAGGACCGATCCACACTGCAGGTGCTGGACTCGGCCAC 
H AGGGAACTGGTTCTCTGCCTGTTTCGACAACTTCACAGAAGCTCTCGCTGAGACAGCCTGTAGGCAGATGG 
4»<? GC T ACAGCAGAGCT GTGGAGATT GGC CCAGAC C AGG AT CT GGAT GT T G TT GAAATC AC AG AAAACAGCCAG 

W GAGCTTCGCATGCGGAACTCAAGTGGGCCCTGTCTCTCAGGCTCCCTGGTCTCCCTGCACTGTCTTGCCTG 
f I TGGGAAGAGCCTGAAGACCCCCCGTGTGGTGGGTGGGGAGGAGGCCTCTGTGGATTCTTGGCCTTGGCAGG 

C * ' 

r? TC AGC ATC C AGT ACG ACAAAC AGC AC GTCT GT GGAGGGAGC ATC CT GG AC CCCC ACT GGGTCCT CACGGC A 



ill 



GCCCACTGCTTCAGGAAACATACCGATGTGTTCAACTGGAAGGTGCGGGCAGGCTCAGACAAACTGGGCAG 



R | CTTCCCATCCCTGGCTGTGGCCAAGATCATCATCATTGAATTCAACCCCATGTACCCCAAAGACAATGACA 
s . . TCGCCCTCATGAAGCTGCAGTTCCCACTCACTTTCTCAGGCACAGTCAGGCCCATCTGTCTGCCCTTCTTT 

GATGAGGAGC TC ACTCCAGC CACCCC ACT C TGGAT C AT T GGAT G GGGCT T T ACGAAGC AGAAT GGAGGGAA 



PI GATGTCTG AC AT ACT GCT GC AGGCGT C AGT CC AGGT CAT T GAC AGC AC AC GGT GCAAT GC AGACGATGCGT 

fcl AC C AGGGGG AAGTCACCG AG AAGATGAT GT GT GCAGGC ATCCCGGAAGGGGG TGTGGAC ACC TGCC AGGG T 

Afti 

GACAGTGGTGGGCCCCTGATGTACCAATCTGACCAGTGGCATGTGGTGGGCATCGTTAGCTGGGGCTATGG 

0 



CTGCGGGGGCCCGAGCACCCCAGGAGTATACACCAAGGTCTCAGCCTATCTCAACTGGATCTACAATGTCT 
GGAAGGCTGAGCTG TAA TGCTGCTGCCCCTTTGCAGTGCTGGGAGCCGCTTCCTTCCTGCCCTGCCCACCT 
GGGGATCCCCCAAAGTCAGACACAGAGCAAGAGTCCCCTTGGGTACACCCCTCTGCCCACAGCCTCAGCAT 
TTCTTGGAGCAGCAAAGGGCCTCAATTCCTGTAAGAGACCCTCGCAGCCCAGAGGCGCCCAGAGGAAGTCA 
GCAGCCCTAGCTCGGCCACACTTGGTGCTCCCAGCATCCCAGGGAGAGACACAGCCCACTGAACAAGGTCT 
CAGGGGTATT GCT AAGCC AAGAAGGAACT T TCCCAC ACT ACT GAATGGAAGC AGGC T GTCTT GT AAAAGCC 
CAGATCACTGTGGGCTGGAGAGGAGAAGGAAAGGGTCTGCGCCAGCCCTGTCCGTCTTCACCCATCCCCAA 
GCCTACTAGAGCAAGAAACCAGTTGTAATATAAAATGCACTGCCCTACTGTTGGTATGACTACCGTTACCT 
ACTGTTGTCATTGTTATTACAGCTATGGCCACTATTATTAAAGAGCTGTGTAACATCTCTGGCAAAAAAAA 
AAAA 
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FIGURE 112 



MLQDPDSDQPLNSLDVKPLRKPRIPMETFRKVGIPIIIALLSLASIIIVWLIKVILDKYYFLCG 
QPLHFIPRKQLCDGELDCPLGEDEEHCVKS FPEGPAVAVRLSKDRSTLQVLDSATGNWFSACFDN 
FTEALAETACRQMGYSRAVEIGPDQDLDWEITENSQELRMRNSSGPCLSGSLVSLHCLACGKSL 
KTPRWGGEEASVDSWPWQVSIQYDKQHYCGGSILDPHWLTAAHCFRKHTDVFNWKVRAGSDKL 
GSFPSLAVAKIIIIEFNPMYPKDNDIALMKLQFPLTFSGTVRPICLPFFDEELTPATPLWIIGWG 
FTKQNGGKMSDILLQASVQVIDSTRCNADDAYQGEVTEKMMCAGIPEGGVDTCQGDSGGPLMYQS 
DQWHWGIVSWGYGCGGPSTPGVYTKVSAYLNWIYNVWKAEL 

Transmembrane domain: 

amino acids' 32-53 (typell) 



m 

I, 
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FIGURE us 



GGCTGGACTGGAACTCCTGGTCCCAAGTGATCCACCCGCCTCAGCCTCCCAAGGTGCTGTGATTA 
TAGGTGTAAGCCACCGTGTCTGGCCTCTGAACAACTTTTTCAGCAACTAAAAAAGCCACAGGAGT 
TGAACTGCTAGGATTCTGACT ATGC TGTGGTGGCTAGTGCTCCTACTCCTACCTACATTAAAATC 
TGTTTTTTGTTCTCTTGTAACTAGCCTTTACCTTCCTAACACAGAGGATCTGTCACTGTGGCTCT 
GGCCCAAACCTGACCTTCACTCTGGAACGAGAACAGAGGTTTCTACCCACACCGTCCCCTCGAAG 
CCGGGGACAGCCTCACCTTGCTGGCCTCTCGCTGGAGCAGTGCCCTCACCAACTGTCTCACGTCT 
GGAGGCACTGACTCGGGCAGTGCAGGTAGCTGAGCCTCTTGGTAGCTGCGGCTTTCAAGGTGGGC 
CTTGCCCTGGCCGTAGAAGGGATTGACAAGCCCGAAGATTTCATAGGCGATGGCTCCCACTGCCC 
AGGCATCAGCCTTGCTGTAGTCAATCACTGCCCTGGGGCCAGGACGGGCCGTGGACACCTGCTCA 
GAAGCAGTGGGTGAGACATCACGCTGCCCGCCCATCTAACCTTTTCATGTCCTGCACATCACCTG 
g,j ATCCATGGGCTAATCTGAACTCTGTCCCAAGGAACCCAGAGCTTGAGTGAGCTGTGGCTCAGACC 

%l CAGAAGGGGT CT GC T TAGACCACC T G GT T T ATG T GACAGGACT T GCAT TC T CC T GGAACATGAGG 

■? 

M| GAACGCCGGAGGAAAGCAAAGTGGGAGGGT^AGGAACTTGTGCCAAATTATGGGTCAGAAAAGATG 
Rl GAGGTGTTGGGTTATCACAAGGCATCGAGTCTCCTGCATTCAGTGGACATGTGGGGGAAGGGCTG 

CCGATGGCGCATGACACACTCGGGACTCACCTCTGGGGCCATCAGACAGCCGTTTCCGCCCCGAT 
CCACGTACCAGCTGCTGAAGGGCAACTGCAGGCCGATGCTCTCATCAGCCAGGCAGCAGCCAAAA 
TCTGCGATCACCAGCCAGGGGCAGCCGTCTGGGAAGGAGCAAGCAAAGTGACCATTTCTCCTCCC 



CI 

0 



Cfl CT CC TT CCCTC TGAGAGGC CC T CCT AT GT CCC TACTAAAGCCACCAGCAAGACATAGC TGACAGG 

M GGCTAATGGCTCAGTGTTGGCCCAGGAGGTCAGCAAGGCCTGAGAGCTGATCAGAAGGGCCTGCT 

FII 

GTGCGAACACGGAAATGCCTCCAGTAAGCACAGGCTGCAAAATCCCCAGGCAAAGGACTGTGTGG 
CT CAATT TAAATCATGT TC TAGTAATTGGAGC TGTCCCCAAGACCAAAGGAGCTAGAGCT TGGT T 
CAAATGATCTCCAAGGGCCCTTATACCCCAGGAGACTTTGATTTGAATTTGAAACCCCAAATCCA 
AACCTAAGAACCAGGTGCATTAAGAATCAGTTATTGCCGGGTGTGGTGGCCTGTAATGCCAACAT 
TTTGGGAGGCCGAGGCGGGTAGATCACCTGAGGTCAGGAGTTCAAGACCAGCCTGGCCAACATGG 
TGAAACCCCTGTCTCTACTAAAAATACAAAATWVCTAGCCAGGCATGGTGGTGTGTGCCTGTATC 
CCAGCTACTCGGGAGGCTGAGACAGGAGAATTACTTGAACCTGGGAGGTGAAGGAGGCTGAGACA 
GGAGAAT CACT TCAGCCTGAGCAACACAGCGAGACTCTGTCTCAGAAAAAATAAAAAAAGAAT T A 
TGGTTATTTGTAA 
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FIGURE 114 



MLWWLVLLLLPTLKSVFCSLVTSLYLPNTEDLSLWLWPKPDLHSGTRTEVSTHTVPSKPGTASPC 
WPLAGAVPSPTVSRLEALTRAVQVAEPLGSCGFQGGPCPGRRRD 

Signal peptide: 

amino acids 1-15 



hi* 



as 

y 

I gig 
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FIGURE lis 



CAGCAGT GGT C TCT CAGT CC TC TCAAAGCAAGGAAAGAG TACTGT GT GC T GAGAGACCATGGCAA 
AGAATCCTCCAGAGAAT TGTGAAGACTGTCACATTC TAAATGCAGAAGCT T TTAAATCCAAGAAA 
ATATGTAAATCACTTAAGATTTGTGGACTGGTGTTTGGTATCCTGGCCCTAACTCTAATTGTCCT 
GTTTTGGGGGAGCAAGCACTTCTGGCCGGAGGTACCCAAAAAAGCCTATGACATGGAGCACACTT 
T C T ACAGCAATGGAGAGAAGAAGAAGAT T TACAT GGAAAT T GAT CC T GT GACC AGAAC TGAAAT A 
T T CAGAAGCGGAAAT GGCAC T GAT GAAAC AT T GGAAG T GCACGACT T TAAAAACGGAT ACAC T GG 
CATCTACTTCGTGGGTCTTCAAAAATGTTTTATCAAAACTCAGATTAAAGTGATTCCTGAATTTT 
iy C T GAAC C AGAAG AG GAAAT AG AT G AG AAT G AAG AAAT T AC C AC AAC T T T C T T T GAAC AG T C AG T G 

ATTTGGGTCCCAGCAGAAAAGCCTATTGAAAACCGAGATTTTCTTAAAAATTCCAAAATTCTGGA 
GAT T TGTGATAAC GT GACCAT GT AT T GGAT CAATC CCAC TC TAAT AT CAG T T T C TGAGT TACAAG 
ACT T TGAGGAGGAGGGAGAAGATCT T CAC TT T C CT GCCAACGAAAAAAAAGGGAT TGAACAAAAT 



\| GAACAGTGGGTGGTCCCTCAAGTGAAAGTAGAGAAGACCCGTCACGCCAGACAAGCAAGTGAGGA 
y| AGAACT T CCAATAAATGACTATAC T GAAAAT GGAATAGAAT TT GAT CC CAT GC T GGATGAGAGAG 

III GTTATTGTTGTATTTACTGCCGTCGAGGCAACCGCTATTGCCGCCGCGTCTGTGAACCTTTACTA 



GGCTACTACCCATATCCATACTGCTACCAAGGAGGACGAGTCATCTGTCGTGTCATCATGCCTTG 
TAACTGGTGGGTGGCCCGCATGCTGGGGAGGGTCTAATAGGAGGTTTGAGCTCAAATGCTTAAAC 



m TGCTGGCAACATATAATAAATGCATGCTATTCAATGAATTTCTGCCTATGAGGCATCTGGCCCCT 
Ill GGTAGCCAGCTCTCCAGAATTACTTGTAGGTAATTCCTCTCTTCATGTTCTAATAAACTTCTACA 



Wf T TAT CACCAAAAAAAAAAAAAAAAAAA 

fll 
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FIGURE 116 



MAKNPPENCEDCHILNAEAFKSKKICKSLKICGLVFGILALTLIVLFWGSKHFWPEVPKKAYDME 
HTFYSNGEKKKIYMEIDPVTRTEIFRSGNGTDETLEVHDFKNGYTGIYFVGLQKCFIKTQIKVIP 
EFSEPEEEIDENEEITTTFFEQSVIWVPAEKPIENRDFLKNSKILEICDNVTMYWINPTLISVSE 
LQDFEEEGEDLHFPANEKKGIEQNEQWWPQVKVEKTRHARQASEEELPINDYTENGIEFDPMLD 
ERGYCC I YCRRGNRYCRRVCE PLLGYYP YP YC YQGGRVICRVIMPCNWW VARMLGRV 

Important features of the protein: 
Signal peptide: 

amino acids 1-40 

Transmembrane domain: 

m 

%A amino acids 25-47 (type II) 

N-glycosylation sites. 

amino acids 94-97, 180-183 

Glycosaminoglycan attachment sites. 

amino acids 92-95, 70-73, 85-88, 133-136, 148-151, 192-195, 239- 
242 

N-myristoylation sites. 

amino acids 33-38, 95-100, 116-121, 215-220, 272-277 

Microbodies C-terminal targeting signal . 

amino acids 315-317 

Cytochrome c family heme-binding site signature. 

amino acids 9-14 




•i :r i 

w 



r \ 

pi 
CI 



APPJD 



=10063732 



Page 270 of 322 
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G AGCT C CCCTCAGGAGCGCGT T AGCT TC AC ACCT T C GG CAGC AGGAGG GC GGC AGCT T CT CGC AGGCG GC A 
GGGCGGGCGGCCAGGATCATGTCCACCACCACATGCCAAGTGGTGGCGTTCCTCCTGTCCATCCTGGGGCT 
GGCCGGCT GC AT CG CGGCC ACCGG GATGG ACAT GTGGAGCACCC AGGACCTGT ACG AC AACCCC GTCACCT 
CC GT GT TCCAGT AC G AAGGGCTCT GG AGGAGCTGCGT GAG GCAGAGT T C AGGCT TC ACCG AAT GCAGGCCC 
TATTTCACCATCCTGGGACTTCCAGCCATGCTGCAGGCAGTGCGAGCCCTGATGATCGTAGGCATCGTCCT 
GGGTGCCATTGGCCTCCTGGTATCCATCTTTGCCCTGAAATGCATCCGCATTGGCAGCATGGAGGACTCTG 
. CCAAAGCCAACATGACACTGACCTCCGGGATCATGTTCATTGTCTCAGGTCTTTGTGCAATTGCTGGAGTG 
TCTGTGTTTGCCAACATGCTGGTGACTAACTTCTGGATGTCCACAGCTAACATGTACACCGGCATGGGTGG 
GATGGTGCAGACTGTTCAGACCAGGTACACATTTGGTGCGGCTCTGTTCGTGGGCTGGGTCGCTGGAGGCC 
CI TCACACTAATTGGGGGTGTGATGATGTGCATCGCCTGCCGGGGCCTGGCACCAGAAGAAACCAACTACAAA 
2;J GCCGTTTCTTATCATGCCTCAGGCCACAGTGTTGCCTACAAGCCTGGAGGCTTCAAGGCCAGCACTGGCTT 
7\ TGGGTCCAACACCAAAAACAAGAAGATATACGATGGAGGTGCCCGCACAGAGGACGAGGTACAATCTTATC 
K j C TTCC AAGCACG ACT ATGT G TAA TGC TCT AAGACCT CTCAGC AC GGGCGGAAGAAACT CC CGGAGAGC TC A 

CCCAAAAAACAAGGAGATCCCATCTAGATTTCTTCTTGCTTTTGACTCACAGCTGGAAGTTAGAAAAGCCT 
CGAT TT CAT CTT TGGAGAGGCC AAAT GGT C TT AGCC TCAGTCTCT GTCTCT AAATAT TC CACCATAAAAC A 
GC TGAGTT AT T TAT GAAT T AGAGGCT AT AGCT CAC ATT T TC AAT CCTC TATT T C TT T T T T T AAATAT AAC T 
TTCTACTCTGATGAGAGAATGTGGTTTTAATCTCTCTCTCACATTTTGATGATTTAGACAGACTCCCCCTC 
TTCCTCCTAGTCAATAAACCCATTGATGATCTATTTCCCAGCTTATCCCCAAGAAAACTTTTGAAAGGAAA 
Y J GAGTAGACCCAAAGATGTTATTTTCTGCTGTTTGAATTTTGTCTCCCCACCCCCAACTTGGCTAGTAATAA 

AC AC TT ACT G AAG AAG AAGC AAT AAG AGAAAGAT ATTT GT AATCTCT C C AGCCC AT GAT C TCGGTT TT CT T 
ACACTGTGATCTTAAAAGTTACCAAACCAAAGTCATTTTCAGTTTGAGGCAACCAAACCTTTCTACTGCTG 
TTGACATCTTCTTATTACAGCAACACCATTCTAGGAGTTTCCTGAGCTCTCCACTGGAGTCCTCTTTCTGT 
CGCGGGTCAGAAATTGTCCCTAGATGAATGAGAAAATTATTTTTTTTAATTTAAGTCCTAAATATAGTTAA 
AAT AAAT AAT GT TT T AGT AAAATG AT ACACTAT CTCTGT G AAAT AGCCTC ACC C CT ACAT GT G GAT AG AAG 
GAAATGAAAAAATAATTGCTTTGACATTGTCTATATGGTACTTTGTAAAGTCATGCTTAAGTACAAATTCC 
ATGAAAAGCT CACACC TGTAAT CCT AGC ACTT T GGGAGGCTG AGGAGG AAGG AT CACTTGAGCCCAG AAGT 
TCGAGACTAGCCTGGGCAACATGGAGAAGCCCTGTCTCTACAAAATACAGAGAGAAAAAATCAGCCAGTCA 
T GGT GGC ATACACCT GT AGT CCCAGC ATTCCGGGAGGCT GAG GT GGGAGG ATC ACT T GAGCCC AGGGAGGT 
TGGGGC TGCAGT GAGC C ATG AT CAC ACCACTGCACT CC AGCC AGGT GACAT AGC GAGAT C CT GTCT AAAAA 
AATAAAAAATAAATAATGGAACACAGCAAGTCCTAGGAAGTAGGTTAAAACTAATTCTTTAA 
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FIGURE 118 



MSTTTCQWAFLLSILGLAGCIAATGMDMWSTQDLYDNPVTSVFQYEGLWRSCVRQSSGFTECRP 
YFTILGLPAMLQAVRAIMIVGIVLGAIGLLVSIFALKCIRIGSMEDSAKANMTLTSGIMFIVSGL 
CAI AGVSVFANMLVTNFWMS TANMYTGMGGMVQTVQTRYT FGAALFVGWVAGGLTLI GGVMMC I A 
CRGLAPEETNYKAVSYHASGHSVAYKPGGFKASTGFGSNTKNKKIYDGGARTEDEVQSYPSKHDY 
V 

Signal peptide: 

amino acids 1-23 




Transmembrane domains : 



amino acids 81-100, 121-141, 173-194 
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GGAAAAACTGTTCTCTTCTGTGGCACAGAGAACCCTGCTTCAAAGCAGAAGTAGCAGTTCCGGAGTCC 
AGCTGGCTAAAACTCATCCCAGAGGATAATGGCAACCCATGCCTTAGAAATCGCTGGGCTGTTTCTTG 
GTGGTGTTGGAATGGTGGGCACAGTGGCTGTCACTGTCATGCCTCAGTGGAGAGTGTCGGCCTTCATT 
GAAAACAACATCGTGGTTTTTGAAAACTTCTGGGAAGGACTGTGGATGAATTGCGTGAGGCAGGCTAA 
CATCAGGATGCAGTGCAAAATCTATGATTCCCTGCTGGCTCTTTCTCCGGACCTACAGGCAGCCAGAG 
GACTGATGTGTGCTGCTTCCGTGATGTCCTTCTTGGCTTTCATGATGGCCATCCTTGGCATGAAATGC 
ACCAGGTGCACGGGGGACAATGAGAAGGTGAAGGCTCACATTCTGCTGACGGCTGGAATCATCTTCAT 
CATCACGGGCATGGTGGTGCTCATCCCTGTGAGCTGGGTTGCCAATGCCATCATCAGAGATTTCTATA 
ACT CAATAGTGAATGT T GCC CAAAAACGT GAGCTT GGAGAAGCTCTCTACT TAGGATGGACCACGGCA 
CTGGTGCTGATTGTTGGAGGAGCTCTGTTCTGCTGCGTTTTTTGTTGCAACGAAAAGAGCAGTAGCTA 



g| CAGAT ACTCGATACCT T CCCATCGCACAACCCAAAAAAGTT AT CACACCGGAAAGAAGTCACC GAGCG 



TCTACTCCAGAAGTCAGTATGTGTAGTTGTGTATGTTTTTTTAACTTTACTATAAAGCCATGCAAATG 
ACAAAAATCTATATTACTTTCTCAAAATGGACCCCAAAGAAACTTTGATTTACTGTTCTTAACTGCCT 
W AAT CT TAAT TACAGGAACT GTGCATCAGCTAT TTAT GAT TCTATAAGCTAT T TCAGCAGAATGAGATA 

m 

2 n TTAAACCCAATGCTTTGATTGTTCTAGAAAGTATAGTAATTTGTTTTCTAAGGTGGTTCAAGCATCTA 

CTCTTTTTATCATTTACTTCAAAATGACATTGCTAAAGACTGCATTATTTTACTACTGTAATTTCTCC 
ACGACATAGCAT TATGTACAT AGATGAGT GTAACAT T TATATCTCACATAGAGACAT GCT TAT AT GGT 
TTTATTTAAAA.TGAAATGCCAGTCCATTACACTGAATAAATAGAACTCAACTATTGCTTTTCAGGGAA 
ATCATGGATAGGGTTGAAGAAGGTTACTATTAATTGTTTAAAAACAGCTTAGGGATTAATGTCCTCCA 
TTTAT AATGAAGATTAAAAT GAAGGCT TTAAT CAGCATT GT AAAGGAAATT GAATGGCTT T CT GATAT 
GCTGTTTTTTAGCCTAGGAGTTAGAAATCCTAACTTCTTTATCCTCTTCTCCCAGAGGCTTTTTTTTT 
CTTGTGTATTAAATTAACATTTTTAAAACGCAGATATTTTGTCAAGGGGCTTTGCATTCAAACTGCTT 
TTCCAGGGCTATACTCAGAAGAAAGATAAAAGTGTGATCTAAGAAAAAGTGATGGTTTTAGGAAAGTG 
AAAATATTTTTGTTTTTGTATTTGAAGAAGAATGATGCATTTTGACAAGAAATCATATATGTATGGAT 
ATATTTTAATAAGTATTTGAGTACAGACTTTGAGGTTTCATCAATATAAATAAAAGAGCAGAAAAATA 
TGTCTTGGTTTTCATTTGCTTACCAAAAAAACAACAACAAAAAAAGTTGTCCTTTGAGAACTTCACCT 
GCTCCTATGTGGGTACCTGAGTCAAAATTGTCATTTTTGTTCTGTGAAAAATAAATTTCCTTCTTGTA 
CCATTTCTGTTTAGTTTTACTAAAATCTGTAAATACTGTATTTTTCTGTTTATTCCAAATTTGATGAA 
ACTGACAATCCAATTTGAAAGTTTGTGTCGACGTCTGTCTAGCTTAAATGAATGTGTTCTATTTGCTT 
TATACATT TATAT TAAT AAATTGTACATT TTT CTAATT 
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L 



MATHALEIAGLFLGGVGMVGTVAVTVMPQWRVSAFIENNIWFEWFWEGLWMNCVRQANIRMQCK 
IYDSLLALSPDLQAARGLMCAASVMS FLAFMMAILGMKCTRCTGDNEKVKAHILLTAGIIFIITG 
3y[WLIPVSWANAIIRDFYNSIVlWAQKRELGEALYLGWTTALVLIVGGALFCCVFCCNEKSSSY 
RYSIPSHRTTQKSYHTGKKSPSVYSRSQYV 

Signal peptide: 

amino acids 1-17 



Q Transmembrane domains: 

M amino acids 82-101, 118-145, 164-188 

ft? 

" ; 
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FIGURE 121 



GGAGAGAGGCGCGCGGGTGAAAGGCGCATTGATGCAGCCTGCGGCGGCCTCGGAGCGCGGCGGAG 
CCAGACGCTGACCACGTTCCTCTCCTCGGTCTCCTCCGCCTCCAGCTCCGCGCTGCCCGGCAGCC 
GGGAGCCATGCGACCCCAGGGCCCCGCCGCCTCCCCGCAGCGGCTCCGCGGCCTCCTGCTGCTCC 
TGCTGCTGCAGCTGCCCGCGCCGTCGAGCGCCTCTGAGATCCCCAAGGGGAAGCAAAAGGCGCAG 
CTCCGGCAGAGGGAGGTGGTGGACCTGTAT7UVTGGAATGTGCTTACAAGGGCCAGCAGGAGTGCC 
TGGTCGAGACGGGAGCCCTGGGGCCAATGTTATTCCGGGTACACCTGGGATCCCAGGTCGGGATG 
GAT T CAAAGGAGAAAAGGGGGAATG T CTGAGGGAAAGCT T TGAGGAG TCC T GGACACCCAAC TAC 
AAGCAGT GT T CAT GGAGT TCAT T GAAT TAT GGCATAGATC T T GGGAAAAT T GCGGAGTGTACAT T 
TACAAAGATGCGTTCAAATAGTGCTCTAAGAGTTTTGTTCAGTGGCTCACTTCGGCTAAAATGCA 
GAAATGCATGCTGTCAGCGTTGGTATTTCACATTCAATGGAGCTGAATGTTCAGGACCTCTTCCC 
: AT T GAAGCTAT AAT T TAT T TGGACCAAGGAAGCCC T GAAATGAAT TCAACAAT TAAT AT TCATCG 

\] CACTTCTTCTGTGGAAGGACTTTGTGAAGGAATTGGTGCTGGATTAGTGGATGTTGCTATCTGGG 
Ul TTGGCACTTGTTCAGATTACCCAAAAGGAGATGCTTCTACTGGATGGAATTCAGTTTCTCGCATC 

AT TAT T GAAGAACTACCAAAATAAATGC T T T AATT T TCAT T TGC T ACC T C T T T T T T TAT TAT GC C 



CI 



TTGGAATGGT TCACTTAAATGACAT TTTAAATAAGTTTATGTATACAT CT GAATGAAAAGCAAAG 
CTAAATATGT TTACAGACCAAAGTGTGAT TTCACACTGT TTTTAAATCTAGCAT TATTCAT TT T G 
**t CTTCAATCAAAAGTGGTTTCAATATTTTTTTTAGTTGGTTAGAATACTTTCTTCATAGTCACATT 



III 



CTCTCAACCTATAATTTGGAATATTGTTGTGGTCTTTTGTTTTTTCTCTTAGTATAGCATTTTTA 
AAAAAATATAAAAGCTACCAATCTTTGTACAATTTGTAAATGTTAAG7VATTTTTTTTATATCTGT 
T AAAT AAAAAT TAT T T C C AACA 
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MRPQGPAAS PQRLRGLLLLLLLQLPAPS SASEI PKGKQKAQLRQRE WDLYNGMCLQGPAGVPGR 
DGSPGANVIPGTPGIPGRDGFKGEKGECLRESFEESWTPNYKQCSWSSLNYGIDLGKIAECTFTK 
MRSNSALRVLFSGSLRLKCRNACGQRWYFTFNGAECSGPLPIEAIIYLDQGSPEMNSTINIHRTS 
SVEGLCEGIGAGLVDVAIWVGTCSDYPKGDASTGWNSVSRI IIEELPK 

Signal peptide: 

amino acids 1-30 

Transmembrane domain: 

amino acids 195-217 
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GCTGAGCGTGTGCGCGGTACGGGGCTCTCCTGCCTTCTGGGCTCCAACGCAGCTCTGTGGCTGAA 
CTGGGTGCTCATCACGGGAACTGCTGGGCTATGGAATACAGATGTGGCAGCTCAGGTAGCCCCAA 
ATTGCCTGGAAGAATACATCATGTTTTTCGATAAGAAGAAATTGTAGGATCCAGTTTTTTTTTTA 
ACCGCCCCCTCCCCACCCCCCAAAAAAACTGTAAAGATGCAAAAACGTAATATCCATGAAGATCC 
TATTACCTAGGAAGATTTTGATGTTTTGCTGCGAATGCGGTGTTGGGATTTATTTGTTCTTGGAG 
TGTTCTGCGTGGCTGGCAAAGAATAATGTTCCAAAATCGGTCCATCTCCCAAGGGGTCCAATTTT 
TCTTCCTGGGTGTCAGCGAGCCCTGACTCACTACAGTGCAGCTGACAGGGGCTGTCATGCAACTG 
GC CCCTAAGCCAAAGCAAAAGACC T AAGGACGACC T TTGAACAATACAAAGGATGGGT T T CAAT G 
TAATTAGGCTACTGAGCGGATCAGCTGTAGCACTGGTTATAGCCCCCACTGTCTTACTGACAATG 
CTTTCTTCTGCCGAACGAGGATGCCCTAAGGGCTGTAGGTGTGAAGGCAAAATGGTATATTGTGA 
AT C T CAGAAAT TACAGGAGATACCCT CAAG TAT AT C T GC T GGT T GC T T AGG T TTGTCCCTTCGCT 
AT AACAG C C T T CAAAAAC T T AAG T AT AAT CAAT T T AAAGGGC T C AAC C AGC T CAC C T GGC TAT AC 
CT TGAC CATAACCATAT CAGCAAT AT TGACGAAAATGC T T T TAATGGAAT ACGCAGAC TCAAAGA 
m GCTGATTCTTAGTTCCAATAGAATCTCCTATTTTCTTAACAATACCTTCAGACCTGTGACAAATT 
SJ TACGGAACTTGGATCTGTCCTATAATGAGCTGCATTCTCTGGGATCTGAACAGTTTCGGGGCTTG 
*fl CGGAAGCTGCTGAGTTTACATTTACGGTCTAACTCCCTGAGAACCATCCCTGTGCGAATATTCCA 
W AGACTGCCGCAACCTGGAACTTTTGGACCTGGGATATAACCGGATCCGAAGTTTAGCCAGGAATG 
\J TCT T TGCTGGCATGATCAGACTCAAAGAAC TTCACCTGGAGCACAAT CAAT TTTCCAAGC TCAAC 

yi| CTGGCCCTTTTTCCAAGGTTGGTCAGCCTTCAGAACCTTTACTTGCAGTGGAATAAAATCAGTGT 
qjf C AT AGGACAGACCAT GTCC TGGACC T GGAGCT CC T TACAAAGGC T TGAT T TAT CAGGCAATGAGA 

TCGAAGCTTTCAGTGGACCCAGTGTTTTCCAGTGTGTCCCGAATCTGCAGCGCCTCAACCTGGAT 
T CCAACAAGC T CACAT T TATTGGT CAAGAGAT T T TGGAT TC TT GGAT AT C C CTCAATGACATCAG 
TCTTGCTGGGAATATATGGGAATGCAGCAGAAATATTTGCTCCCTTGTAAACTGGCTGAAAAGTT 
HI TTAAAGGTCTAAGGGAGAATACAATTATCTGTGCCAGTCCCAAAGAGCTGCAAGGAGTAAATGTG 
m ATC GATGCAGTGAAGAAC TACAGCAT CT GT GGCAAAAGTAC TACAGAGAGGTT TGAT C TGGC CAG 

M% GGCTCTCCCAAAGCCGACGTTTAAGCCCAAGCTCCCCAGGCCGAAGCATGAGAGCAAACCCCCTT 
Jjjjf TGCCCCCGACGGTGGGAGCCACAGAGCCCGGCCCAGAGACCGATGCTGACGCCGAGCACATCTCT 
CI TTCCATAAAATCATCGCGGGCAGCGTGGCGCTTTTCCTGTCCGTGCTCGTCATCCTGCTGGTTAT 
f|| CTACGTGTCATGGAAGCGGTACCCTGCGAGCATGAAGCAGCTGCAGCAGCGCTCCCTCATGCGAA 

GGCACAGGAAAAAGAAAAGACAG T CC CT AAAGCAAAT GAC TCCCAGCACCCAGGAAT T T T AT GT A 
GAT TATAAACC CAC CAACACGGAGACCAGCGAGATGC TGC TGAATGGGAC GGGACCC T GCACCT A 
T AACAAAT C G GGC T C C AG GGAG T G T GAG G T ATGA AC CAT T G T G AT AAAAAGAG C T C T TAAAAGC T 
GGGAAATAAGTGGTGCTTTATTGAACTCTGGTGACTATCAAGGGAACGCGATGCCCCCCCTCCCC 
TTCCCTCTCCCTCTCACTTTGGTGGCAAGATCCTTCCTTGTCCGTTTTAGTGCATTCATAATACT 
GGTCATT TTCCTCTCATACATAAT CAACCCAT TGAAATT TAAATACCACAATCAATGTGAAGCT T 
GAACTCCGGTTTAATATAATACCTATTGTATAAGACCCTTTACTGATTCCATTAATGTCGCATTT 
GTTTTAAGATAAAACTTCTTTCATAGGTAAAAAAAAAAA 
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FIGURE 124 

MGFNVIRLLSGSAVALVIAPTVLLTMLSSAERGCPKGCRCEGKMVYCESQKLQEIPSSISAGCLG 
LS LRYNS LQKLK YNQ FKGLNQLTWL YLDHNH I SN I DENAFNG I RRLKEL I LS SNRI S Y FLNNT FR 
PVTNLRNLDLSYNQLHSLGSEQFRGLRKLLSLHLRSNSLRTIPVRI FQDCRNLELLDLGYNRIRS 
LARNVFAGMIRLKELHLEHNQFSKLNLALFPRLVSLQNLYLQWNKISVIGQTMSWTWSSLQRLDL 
SGNEIEAFSGPSVFQCVPNLQRLNLDSNKLTFIGQEILDSWISLNDISLAGNIWECSRNICSLVN 
WLKS FKGLRENT 1 1 CAS PKELQGVNVI DAVKNYS ICGKS T TERFDLARALPKPT FKPKLPRPKHE 
SKPPLPPTVGATEPGPETDADAEHISFHKIIAGSVALFLSVLVILLVIYVSWKRYPASMKQLQQR 
SLMRRHRKKKRQSLKQMTPSTQEFYVDYKPTNTETSEMLLNGTGPCTYNKSGSRECEV 

Important features of the protein: 
Signal peptide: 

amino acids 1-33 

Transmembrane domain: 

amino acids 420-442 

N-glycosylation sites. 

amino acids 126-129, 357-360, 496-499, 504-507 

cAMP- and cGMP -dependent protein kinase phosphorylation site. 

amino acids 465-468 

Tyrosine kinase phosphorylation site. 

amino acids 136-142 

N-myristoylation sites. 

amino acids 11-16, 33-38, 245-250, 332-337, 497-502, 507-512 
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CCGTTATCGTCTTGCGCTACTGCTGAATGTCCGTCCCGGAGGAGGAGGAGAGGCTTTTGCCGCTG 
ACCCAGAGATGGCCCCGAGCGAGCAAATTCCTACTGTCCGGCTGCGCGGCTACGGTGGCCGAGCT 
AGCAACCTTTCCCCTGGATCTCACAAAAACTCGACTCCAAATGCAAGGAGAAGCAGCTCTTGCTC 
GGTTGGGAGACGGTGCAAGAGAATCTGCCCCCTATAGGGGAATGGTGCGCACAGCCCTAGGGATC 
AT T GAAGAGGAAGGC T TT CTAAAGC T TT GGCAAGGAGTGACACCCGCCAT T T ACAGACACGTAGT 
GTATTCTGGAGGTCGAATGGTCACATATGAACATCTCCGAGAGGTTGTGTTTGGCAAAAGTGAAG 
ATGAGCATTATCCCCTTTGGAAATCAGTCATTGGAGGGATGATGGCTGGTGTTATTGGCCAGTTT 
T T AGCCAATCCAAC T GAC CTAGTGAAGGT T CAGATGCAAATGGAAGGAAAAAGGAAAC TGGAAGG 
AAAACCATTGCGATTTCGTGGTGTACATCATGCATTTGCAAAAATCTTAGCTGAAGGAGGAATAC 
6! GAGGGCT T T GGGCAGGC T GGG T ACCCAAT AT ACAAAGAGC AGC AC TGGT G AAT AT GGGAGAT T T A 

AC CAC T TATGAT AC AGTGAAACACT ACT T GGT AT TGAATACACCAC T T GAGGAC AAT AT C ATGAC 
TCACGGTTTATCAAGTTTATGTTCTGGACTGGTAGCTTCTATTCTGGGAACACCAGCCGATGTCA 
T CAAAAGC AGAATAAT GAAT C AACCACGAGATAAAC AAGGAAGGGG AC T T T T GT ATAAATC ATCG 
ACTGACTGCT TGAT TCAGGCTGT TCAAGGTGAAGGAT TCATGAGTCTATATAAAGGC TTTT TACC 



Is: I 



33 ATCTTGGCTGAGAATGACCCCTTGGTCAATGGTGTTCTGGCTTACTTATGAAAAAATCAGAGAGA 

T GAGTGGAGT CAGTCCAT T TTAA 

» 

ST; 

81 

5ftKf? 

hi 
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FIGURE 126 

MSVPEEEERLLPLTQRWPRASKFLLSGCAATVAELATFPLDLTKTRLQMQGEAALARLGDGARES 
AP YRGMVRT ALG 1 1 EEE G FLKLWQGVT PAI YRHVVYSGGRMVT YE HLREVVFGKS EDEH YPLWKS 
VI GGMMAGVI GQFLANPTDLVKVQMQMEGKRKLEGKPLRFRGVHPiAFAKI LAEGG IRGLWAGWVP 
NIQRAALVNMGDLTTYDTVKHYLVLNT PLEDNIMTHGLS SLCSGLVAS I LGTPADVIKSRIMNQP 
RDKQGRGLLYKSSTDCLIQAVQGEGFMSLYKGFLPSWLRMTPWSMVFWLTYEKIREMSGVSPF 

Transmembrane domains: 

amino acicls 25-38, 130-147, 233-248 
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FIGURE 127 



CGCGGATCGGACCCAAGCAGGTCGGCGGCGGCGGCAGGAGAGCGGCCGGGCGTCAGCTCCTCGAC 
CCCCGTGTCGGGCTAGTCCAGCGAGGCGGACGGGCGGCGTGGGCCCATGGCCAGGCCCGGCATGG 
AGCGGTGGCGCGACCGGCTGGCGCTGGTGACGGGGGCCTCGGGGGGCATCGGCGCGGCCGTGGCC 
CGGGCCCTGGTCCAGCAGGGACTGAAGGTGGTGGGCTGCGCCCGCACTGTGGGCAACATCGAGGA 
GCTGGCTGCTGAATGTAAGAGTGCAGGCTACCCCGGGACTTTGATCCCCTACAGATGTGACCTAT 
CAAATGAAGAGGACATCCTCTCCATGTTCTCAGCTATCCGTTCTCAGCACAGCGGTGTAGACATC 
TGCATCAACAATGCTGGCTTGGCCCGGGCTGACACCCTGCTCTCAGGCAGCACCAGTGGTTGGAA 
GGACATGTTCAATGTGAACGTGCTGGCCCTCAGCATCTGCACACGGGAAGCCTACCAGTCCATGA 
AGGAGCGGAATGTGGACGATGGGCACATCATTAACATCAATAGCATGTCTGGCCACCGAGTGTTA 
CCCCTGTCTGTGACCCACTTCTATAGTGCCACCAAGTATGCCGTCACTGCGCTGACAGAGGGACT 
m GAGGCAAGAGCTTCGGGAGGCCCAGACCCACATCCGAGCCACGTGCATCTCTCCAGGTGTGGTGG 

AGACACAATTCGCCTTCAAACTCCACGACAAGGACCCTGAGAAGGCAGCTGCCACCTATGAGCAA 
ATGAAGTGTCTCA7\ACCCGAGGATGTGGCCGAGGCTGTTATCTACGTCCTCAGCACCCCCGCACA 
CATCCAGATTGGAGACATCCAGATGAGGCCCACGGAGCAGGTGACC TAGT GACTGTGGGAGCTCC 
TCCTTCCCTCCCCACCCTTCATGGCTTGCCTCCTGCCTCTGGATTTTAGGTGTTGATTTCTGGAT 



m 



mi 
M 

m 



fl CACGGGATACCACTTCCTGTCCACACCCCGACCAGGGGCTAGAAAATTTGTTTGAGATTTTTATA 

y'l TCATCTTGTCAAATTGCTTCAGTTGTAAATGTGAAAAATGGGCTGGGGAAAGGAGGTGGTGTCCC 

W TAATTGTTTTACTTGTTAACTTGTTCTTGTGCCCCTGGGCAGTTGGCCTTTGTCTGCTCTCAGTG 

Q 



TCTTCCCTTTGACATGGGAAAGGAGTTGTGGCCAAAATCCCCATCTTCTTGCACCTCAACGTCTG 
f i; : TGGCTCAGGGCTGGGGTGGCAGAGGGAGGCCTTCACCTTATATCTGTGTTGTTATCCAGGGCTCC 

AGACTTCCTCCTCTGCCTGCCCCACTGCACCCTCTCCCCCTTATCTATCTCCTTCTCGGCTCCCC 
AGCCCAGTCTTGGCTTCTTGTCCCCTCCTGGGGTCATCCCTCCACTCTGACTCTGACTATGGCAG 
CAGAACACCAGGGCC TGGCCCAGT GGAT T TCATGGT GATCATT AAAAAAGAAAAATCGCAACCAA 
AAAAAAAAAA 
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FIGURE 128 



if** 

5! 



Mi- 

9:1 1 



pi 

«1 

EI 
81 

ni 



MARPGMERWRDRLALVTGAS GG I GAAVARALVQQGLKWGCART VGN IEE LAAECKSAGYPGTL I 
PYRCDLSNEEDILSMFSAIRSQHSGVDICINNAGLARPDTLLSGSTSGWKDMFWNVLALSICTR 
EAYQSMKERNVDDGHIININSMSGHRVLPLSVTHFYSATKYAVTALTEGLRQELREAQTHIRATC 
ISPGWETQFAFKLHDKDPEKAAATYEQMKCLKPEDVAEAVIYVLSTPAHIQIGDIQMRPTEQVT 

Important features of the protein: 
Signal peptide: 

amino acids 1-17 

N-myristoylation sites. 

amino acids 18-24, 21-27, 22-28, 24-30, 40-46, 90-96, 109-115, 



£\ 199-205 



Short-chain alcohol dehyrogenase . 

amino acids 30-42, 104-114 
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FIGURE 12Q 



AACTTCTACATGGGCCTCCTGCTGCTGGTGCTCTTCCTCAGCCTCCTGCCGGTGGCCTACACCAT 
CATGTCCCTCCCACCCTCCTTTGACTGCGGGCCGTTCAGGTGCAGAGTCTCAGTTGCCCGGGAGC 
ACCTCCCCTCCCGAGGCAGTCTGCTCAGAGGGCCTCGGCCCAGAATTCCAGTTCTGGTTTCATGC 
CAGCCTGTAAAAGGCCATGGAACTTTGGGTGAATCACCGATGCCATTTAAGAGGGTTTTCTGCCA 
GGATGGAAATGTTAGGTCGTTCTGTGTCTGCGCTGTTCATTTCAGTAGCCACCAGCCACCTGTGG 
CCGTTGAGTGCTTGAAATGAGGAACTGAGAAAATTAATTTCTCATGTATTTTTCTCATTTATTTA 
T TAATT T T T AACT GATAGT TGTACAT AT TT GGGGGTACATGTGAT AT T T GGATACAT GTATACAA 
T ATATAAT GATCAAATCAGGGT AACT GGGATATCGAT CACATCAAACAT T TAT T T T T TAT TCT T T 
T TAGACAGAGT CTCAC T C T GT CAC GCAGGCTGGAG T GCAGTGGT GCCAT C TCAGCT TACTGCAAC 
CTCTGCCTGCCAGGTTCAAGCGATTCTCATGCCTCCACCTCCCAAGTAGCTGGGACTACAGGCAT 
GCACCACAATGCCCAACTAATTTTTGTATTTTTAGTAGAGACGGGGTTTTGCCATGTTGCCCAGG 



%i CTGGCCTTGAACTCCTGGCCTCAAACAATCCACTTGCCTCGGCCTCCCAAAGTGTTATGATTACA 
111 GGCGTGAGCCACCGTGCCTGGCCTAAACATTTATCTTTTCTTTGTGTTGGGAACTTTGAAATTAT 
I *f ACAATGAATTATTGTTAACTGTCATCTCCCTGCTGTGCTATGGAACACTGGGACTTCTTCCCTCT 



ATCTAACTGTATATTTGTACCAGTTAACCAACCGTACTTCATCCCCACTCCTCTCTATCCTTCCC 
AACCTCTGATCACCTCATTCTACTCTCTACCTCCATGAGATCCACTTTTTTAGCTCCCACATGTG 



AGTAAGAAAATGCAATATTTGTCTTTCTGTGCCTGGCTTATTTCACTTAACATAATGACTTCCTG 
K| TTCCATCCATGTTGCTGCAAATGACAGGATTTCGTTCTTAATTTCAATTAAAATAACCACACATG 



GCAAAAA 

PI 
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FIGURE ISO 



MGLLLLVLFLSLLPVAYTIMSLPPSFDCGPFRCRVSVAREHLPSRGSLLRGPRPRIPVLVSCQPV 
KGHGTLGE S PMP FKRVFCQDGNVRS FCVCAVHFS S HQPPVAVECLK 

Important features of the protein: 
Signal peptide: 

amino acids 1-18 

N-myristoylation site. 




amino acids 86-92 



Zinc carboxypeptidases , zinc-binding region 2 signature. 



amino acids 68-79 
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FIGURE 131 



TT CTGAAGT AAC GGAAGCT ACCTT GT AT AAAG AC CT C AAC ACTGCT GACC ATGATCAGCGCAGC CT GGAGC 
ATCTTCCTCATCGGGACTAAAATTGGGCTGTTCCTTCAAGTAGCACCTCTATCAGTTATGGCTAAATCCTG 
TCCATCTGTGTGTCGCTGCGATGCGGGTTTCATTTACTGTAATGATCGCTTTCTGACATCCATTCCAACAG 
GAAT AC C AGAGG AT GCT ACAACT C TC TACC TTCAGAAC AAC C AAAT AAAT AATGCT GG G ATT CC TTCAGAT 
TT GAAAAACTTGCT GAAAGT AGAAAG AAT ATACCT ATACC AC AACAGT T TAG AT GAAT T T CCT AC C AACCT 
CCCAAAGT AT GT AAAAGAGT TAC ATT T GC AAGAAAAT AAC AT AAGGACT AT C AC TT AT G ATT CACTT T CAA 
AAATTCCCTATCTGGAAGAATTACATTTAGATGACAACTCTGTCTCTGCAGTTAGCATAGAAGAGGGAGCA 
TTCCGAGACAGCAACTATCTCCGACTGCTTTTCCTGTCCCGTAATCACCTTAGCACAATTCCCTGGGGTTT 
hk GCCC AGGACT AT AG AAGAAC T ACG CT TGGATG AT AATCGC AT AT CC AC T ATT T CAT C ACCAT CTCTTC AAG 

I*:? GT CT C ACT AGT C T AAAACGC CT GGT T CT AGAT GG AAACCTGT TGAAC AAT C ATGGT TT AG GT GAC AAAGT T 

h? TTCTTCAACCTAGTTAATTTGACAGAGCTGTCCCTGGTGCGGAATTCCCTGACTGCTGCACCAGTAAACCT 

m 

fl TCCAGGC AC AAACCTG AGGAAGCT TT AT CT TCAAGATAAC C AC ATCAATC GGGT GCCCCC AAAT GCT T TT T 

f\ CTTATCTAAGGCAGCTCTATCGACTGGATATGTCCAATAATAACCTAAGTAATTTACCTCAGGGTATCTTT 

GAT GAT TT GGAC AATAT AAC AC AACT GATT CT T C GC AACAAT CCCTGGTATTGC GGGTGC AAGATG AAAT G 
GGTACGTGACTGGTTACAATCACTACCTGTGAAGGTCAACGTGCGTGGGCTCATGTGCCAAGCCCCAGAAA 
g AGGTTCGTGGGATGGCTATTAAGGATCTCAATGCAGAACTGTTTGATTGTAAGGACAGTGGGATTGTAAGC 
|;!| ACCATTCAGATAACCACTGCAATACCCAACACAGTGTATCCTGCCCAAGGACAGTGGCCAGCTCCAGTGAC 
Iff CAAACAGCCAGAT ATT AAGAACCCC AAG CT CACT AAGGAT C AAC AAACC AC AGGGAGT CCCT CAAG AAAAA 

C| CAAT TAC AAT TACT GT GAAGTCTGTC AC CT CTGAT ACC ATTC AT ATCT CT TGG AAACTT GCT C T ACCT ATG 

I£f ACTGCTTTGAGACTCAGCTGGCTTAAACTGGGCCATAGCCCGGCATTTGGATCTATAACAGAAACAATTGT 

AACAGGGGAACGCAGTGAGTACTTGGTCACAGCCCTGGAGCCTGATTCACCCTATAAAGTATGCATGGTTC 
CCATGGAAACCAGCAACCTCTACCTATTTGATGAAACTCCTGTTTGTATTGAGACTGAAACTGCACCCCTT 
CGAATGTACAACCCTACAACCACCCTCAATCGAGAGCAAGAGAAAGAACCTTACAAAAACCCCAATTTACC 
TTTGGCTGCCATCATTGGTGGGGCTGTGGCCCTGGTTACCATTGCCCTTCTTGCTTTAGTGTGTTGGTATG 
TT CAT AGG AATGGAT C GC TCTT CT C AAGGAACTGTG CAT AT AGC AAAGGG AG GAGAAG AAAGGATGAC TAT 
GC AG AAGCT GGC ACT AAG AAGGAC AAC T CT ATCC TGGAAAT C AGGGAAACT T C T TT T C AGAT GT TACC AAT 
AAGCAATGAACCCATCTCGAAGGAGGAGTTTGTAATACACACCATATTTCCTCCTAATGGAATGAATCTGT 
ACAAAAACAATCACAGTGAAAGCAGTAGTAACCGAAGCTACAGAGACAGTGGTATTCCAGACTCAGATCAC 
T C AC AC TC ATGA TGCT GAAGGACT CAC AGC AGACTT GTGT TT TGGG TT TT TT AAACCT AAGGG AGGT G ATG 
GT 



ru 



f i! 
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FIGURE tag 

MISAAWSIFLIGTKIGLFLQVAPLSVMAKSCPSVCRCDAGFIYCNDRFLTSIPTGIPEDATTLYL 
QNNQINNAGIPSDLKNLLKVERIYLYHNSLDEFPTNLPKYVKELHLQENMIRTITYDSLSKIPYL 
EELHLDDNSVSAVSIEEGAFRDSNYLRLLFLSRNHLSTIPWGLPRTIEELRLDDNRISTISSPSL 
QGLTSLKRLVLDGNLLNNHGLGDKVFFNLVNLTELSLVRNSLTAAPVNLPGTNLRKLYLQDNHIN 
RVPPNAFSYLRQLYRLDMSNNNLSNLPQGIFDDLDNITQLILRNNPWYCGCKMKWVRDWLQSLPV 
KVNVRGLMCQAPEKVRGMAIKDLNAELFDCKDSGIVSTIQITTAIPNTVYPAQGQWPAPVTKQPD 
I KNPKLTKDQQT TGS PS RKT I T I TVKS VT SDT IH I S WKLAL PMTALRLS WLKLGHS PAFGS I TE T 
IVTGERSEYLVTALEPDSPYKVCMVPMETSNLYLFDETPVCIETETAPLRMYNPTTTLNREQEKE 
P YKNPNLPLAAI I GGAVALVT I ALLALVCWYVHRNGS L FSRNCAYSKGRRRKDDYAEAGTKKDNS 
I LE IRETS FQMLP I SNEP I SKEEFVIHT I FPPNGMNLYKNNHSESSSNRSYRDSGI PDSDHSHS 

Important features of the protein: 
Signal peptide: 

amino acids 1-28 
Transmembrane domain: 
amino acids 531-552 
N-glycosylation sites. 

amino acids 226-229, 282-285, 296-299, 555-558, 626-629, 633-636 
Tyrosine kinase phosphorylation site, 
amino acids 515-522 
N-myristoylation sites. 

amino acids 12-17, 172-177, 208-213, 359-364, 534-539, 556-561, 
640-645 

Amidation site. 

amino acids 567-570 
Leucine zipper pattern. 

amino acids 159-180 

Phospholipase A2 aspartic acid active site. 

amino acids 34-44 
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FIGURE 133 



few? 



m 

H 



CCGTCATCCCCCTGCAGCGACCCTTCCCAGAGTCCTTTGCCCAGGCCACCCCAGGCTTCTTGGCA 
GCCCTGCCGGGCCACTTGTCTTCATGTCTGCCAGGGGGAGGTGGGAAGGAGGTGGGAGGAGGGCG 
TGCAGAGGCAGTCTGGGCTTGGCCAGAGCTCAGGGTGCTGAGCGTGTGACCAGCAGTGAGCAGAG 
GCCGGCCATGGCCAGCCTGGGGCTGCTGCTCCTGCTCTTACTGACAGCACTGCCACCGCTGTGGT 
CCTCCTCACTGCCTGGGCTGGACACTGCTGAAAGTAAAGCCACCATTGCAGACCTGATCCTGTCT 
GCGC TGGAGAGAGCCACCGTCT T CCTAGAACAGAGGC TGCC T GAAAT CAACC TGGAT GGCATGGT 
GGGGGTCCGAGTGCTGGAAGAGCAGCTAAAAAGTGTCCGGGAGAAGTGGGCCCAGGAGCCCCTGC 
s . TGCAGCCGCTGAGCCTGCGCGTGGGGATGCTGGGGGAGAAGCTGGAGGCTGCCATCCAGAGATCC 
Q CTCCACTACCTCAAGCTGAGTGATCCCAAGTACCTAAGAGAGTTCCAGCTGACCCTCCAGCCCGG 

GTTTTGGAAGCTCCCACATGCCTGGATCCACACTGATGCCTCCTTGGTGTACCCCACGTTCGGGC 
CCCAGGACTCATTCTCAGAGGAGAGAAGTGACGTGTGCCTGGTGCAGCTGCTGGGAACCGGGACG 
GACAGCAGCGAGCCC TGCGGCCT CTCAGACCTC TGCAGGAGCCTCATGACCAAGCCCGGCTGCT C 
AGGCTACTGCCTGTCCCACCAACTGCTCTTCTTCCTCTGGGCCAGAATGAGGGGATGCACACAGG 
111 GACCACTCCAACAGAGCCAGGACTATATCAACC TC TTCTGCGCCAACATGAT GGACT TGAACCGC 

* AGAGCTGAGGCCATCGGATACGCCTACCCTACCCGGGACATCTTCATGGAAAACATCATGTTCTG 

TGGAATGGGCGGCTTCTCCGACTTCTACAAGCTCCGGTGGCTGGAGGCCATTCTCAGCTGGCAGA 
W AACAGC AG G AAGGAT GCTTCGGG G AGG C T GAT G C T GAAGAT GAAGAAT TATCTAAAGCTAT TCAA 

Q| TAT CAGC AGCAT T T TTCGAGGAGAGT GAAGAGGCGAGAAAAACAAT TTCCAGATTCTCGGTC TGT 

C| TGCTCAGGCTGGAGTACAGTGGCGCAATCTCGGCTCACTGCAACCTTTGCCTCCTGGGTTCAAGC 
i M AATTCTCTTGCCTCATCCTCCCGAGTAGCTGGGACTACAGGAGCGTGCCACCATACCTGGCTAAT 

T T T TAT AT T T T T T TAGTAGAGACAGG GT T T CATCATGT TGCTCATGC T GGT CTCGAACTCC T GAT 
CTCAAGAGATCCGCCCACCTCAGGCTCCCAAAGTGTGGGATT ATAGG TGTGAGGCACCGTGTCTG 
GC T GAAAAGCAC T T T CAAAGAGACT GT GT T GAAT AAAGGGCCAAGGT T CT T GCCACCCAGCAC T C 
ATGGGGGCTCTCTCCCCTAGATGGCTGCTCCTCCCACAACACAGCCACAGCAGTGGCAGCCCTGG 
G T GGC T TCCT ATACATCC TGGCAGAAT AC CC CCCAGCAAACAGAGAGCCACACC CATCCACACCG 
CCACCACCAAGCAGCCGCTGAGACGGACGGTTCCATGCCAGCTGCCTGGAGGAGGAACAGACCCC 
TTTAGTCCTCATCCCTTAGATCCTGGAGGGCACGGATCACATCCTGGGAAGAAGGCATCTGGAGG 
ATAAGCAAAGCCACCCCGACACCCAATCTTGGAAGCCCTGAGTAGGCAGGGCCAGGGTAGGTGGG 
GGCCGGGAGGGACCCAGGTGTGAACGGATGAATAAAGTTCAACTGCAACTGAAAAAAAAAAA 
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FIGURE 134 



MSARGRWEGGGRRACRGSLGLARAQG 

TAESKATIADLILSALERATVFLEQRLPE INLDGMVGVRVLEEQLKSVREKWAQEPLLQPLSLRV 
GMLGEKLEAAIQRSLHYLKLSDPKYLREFQLTLQPGFWKLPHAWIHTDASLVYPTFGPQDSFSEE 
RSDVCLVQLLGTGTDSSEPCGLSDLCRSLMTKPGCSGYCLSHQLLFFLWARMRGCTQGPLQQSQD 
YINLFCANMMDLNRRAEAIGYAYPTRDIFMENIMFCGMGGFSDFYKLRWLEAILSWQKQQEGCFG 
EPDAEDEELSKAIQYQQHFSRRVKRREKQFPDSRSVAQAGVQWRNLGSLQPLPPGFKQFSCLILP 
SSWDYRSVPPYLANFYIFLVETGFHHVAHAGLELLISRDPPTSGSQSVGL 



Important features of the protein: 



Signal peptide: 



amino acids 1-26 



Transmembrane domain: 



amino acids 39-56 




Tyrosine kinase phosphorylation sites . 

amino acids 149-156, 274-282 



N-myristoylation sites. 

amino acids 10-16, 20-26, 63-69, 208-214 



Amidation site. 



amino acids 10-14 



Glycoprotein hormones beta chain signature 1. 



amino acids 230-237 
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FIGURE 135 



GGTCTGAGTGCAGAGCTGCTGTC ATG GCGGCCGCTCTGTGGGGCTTCTTTCCCGTCCTGCTGCTG 
CTGCTGCTATCGGGGGATGTCCAGAGCTCGGAGGTGCCCGGGGCTGCTGCTGAGGGATCGGGAGG 
GAGTGGGGTCGGCATAGGAGATCGCTTCAAGATTGAGGGGCGTGCAGTTGTTCCAGGGGTGAAGC 
CTCAGGACTGGATCTCGGCGGCCCGAGTGCTGGTAGACGGAGAAGAGCACGTCGGTTTCCTTAAG 
ACAGAT GGGAG T T T TGT GGT T CAT GAT ATACC T TC T GGAT CT TAT GT AG TGGAAGTT G TAT C TCC 
AGCTTACAGATTTGATCCCGTTCGAGTGGATATCACTTCGAAAGGAAAAATGAGAGCAAGATATG 
TGAATTACATCAAAACATCAGAGGTTGTCAGACTGCCCTATCCTCTCCAAATGAAATCTTCAGGT 
GCACCTTCTTACTTTATTAAAAGGGAATCGTGGGGCTGGACAGACTTTCTAATGAACCCAATGGT 
TATGATGATGGTTCTTCCTTTATTGATATTTGTGCTTCTGCCTAAAGTGGTCAACACAAGTGATC 
CTGAGATGAGACGGGAAATGGAGCAGT CAAT GAATATGC TGAAT TCCAACCATGAGT TGCC TGAT 
GT TTCT GAGT TCATGACAAGACT CT T CT C TT CAAAAT CATC TGGCAAATCTAGCAGCGGCAGCAG 
TAAAACAGGCAAAAGTGGGGCTGGCAAAAGGAGG TAG TCAGGCCGTCCAGAGCTGGCATTTGCAC 
AAACACGGCAACACTGGGTGGCATCC^AGTCTTGGAAAACCGTGTGAAGCAACTACTATAAACTT 



TH GAGTCATCCCGACGTTGATCTCTTACAACTGTGTATGTT 



AACTTTTTAGCACATGTTTTGTACTTGGTACACGAGAAAACCCAGCTTTCATCTTTTGTCTGTAT 



w GAGGT CAATAT TGATG TCAC T GAATT AAT T ACAGT G TCC TATAGAAAAT GCCAT TAATAAAT TAT 

111 



ATGAACTACTATACAT TAT GTATATT AAT T AAAACATC T TAATCCAGAAATCAAAAAAAAAAAAA 
m AAAAAAAAAAAAAAA 
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FIGURE 136 



MAAALWGFFPVLLLLLLSGDVQSSEVPGAAAEGSGGSGVGIGDRFKIEGRAVVPGVKPQDWISAA 
RVLVDGEEHVGFLKTDGS FWHDI PSGS YWEWS PAYRFDPVRVDI TSKGKMRARYVNYIKTSE 
VVRLPYPLQMKSSGPPSYFIKRESWGWTDFLMNPMVMiyny^ 
QSMNMLNSNHELPDVSEFMTRLFSSKSSGKSSSGSSKTGKSGAGKRR 

Important features of the protein: 
Signal sequence: 

amino acids 1-23 

Transmembrane domain: 

amino acids 161-182 



Vj. N-glycosylation site. 

amino acids 184-187 



p*,; Glycosaminoglycan attachment sites. 

iii amino acids 37-40, 236-239 



\i cAMP- and cGMP-dependent protein kinase phosphorylation site, 

amino acids 151-154 



N-myristoylation sites. 

amino acids 33-38, 36-41, 38-44, 229-234 

Amidation site. 

amino acids 238-241 

ATP/GTP-binding site motif A (P-loop) . 

amino acids 229-236 
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FIGURE 137 



GATGGCGCAGCCACAGCTTCTGTGAGATTCGATTTCTCCCCAGTTCCCCTGTGGGTCTGAGGGGA 
CCAGAAGGGTGAGCTACGTTGGCTTTCTGGAAGGGGAGGCTAT ATG CGTCAATTCCCCAAAACAA 
GTTTTGACATTTCCCCTGAAATGTCATTCTCTATCTATTCACTGCAAGTGCCTGCTGTTCCAGGC 
CTTACCTGCTGGGCACTAACGGCGGAGCCAGGATGGGGACAGAATAAAGGAGCCACGACCTGTGC 
CACCAACTCGCACTCAGACTCTGAACTCAGACCTGAAATCTTCTCTTCACGGGAGGCTTGGCAGT 
TTTTCTTACTCCTGTGGTCTCCAGATTTCAGGCCTAAGATGAAAGCCTCTAGTCTTGCCTTCAGC 
CTTCTCTCTGCTGGGTTTTATCTCCTATGGACTCCTTCCACTGGACTGAAGACACTCAATTTGGG 
AAGC T GT GT GAT CGC CACAAACC T T CAGGAAATACGAAAT GGATTT T C T GAGATACGGGGCAGT G 

0 TGCAAGCCAAAGATGGAAA.CATTGACATCAGAATCT TAAGGAGGACT GAGT CTTTGCAAGACACA 

■ Q AAGCCTGCGAATCGATGCTGCCTCCTGCGCCATTTGCTAAGACTCTATCTGGACAGGGTATTTAA 

¥ ? AAACTACCAGACCCCTGACCATTATACTCTCCGGAAGATCAGCAGCCTCGCCAATTCCTTTCTTA 



■Si 



CCAT CAAGAAGGAC CTCCGGC TC TC TCAT GCCCACAT GACAT GCCAT T GT GGGGAGGAAGCAAT G 
AAGAAATACAGCCAGATTCTGAGTCACTTTGAAAAGCTGGAACCTCAGGCAGCAGTTGTGAAGGC 
TTTGGGGGAACTAGACATTCTTCTGCAATGGATGGAGGAGACAGAATAGGAGGAAAGTGATGCTG 
SR C TGCT AAGAATATT CGAGG TCAAGAGCT CCAGT CTT CAATACCT GCAGAGGAGGCAT GACCCCAA. 

W ACCACCAT C T CT TTAC TGTACT AGT C T T GT GC T GGT CACAGT GT AT C T TAT T TATGCAT TAC TT G 

%l CTTCCTTGCATGATTGTCTTTATGCATCCCCAATCTTAATTGAGACCATACTTGTATAAGATTTT 
m» TGTAATATCTTTCTGCTATTGGATATATTTATTAGTTAATATATTTATTTATTTTTTGCTATTTA 



ATGTATTTATT TTTT TACT TGGACAT GAAACT TTAAAAAA&TTCACAGATTATAT TTATAACCTG 



fl| ACTAGAGCAGGTGATGTATTTTTATAGAGTAAAAAAAAAAAACCTTGTAAATTCTAGAAGAGTGG 

CTAGGGGGGTTATTCATTTGTATTCAACTAAGGACATATTTACTCATGCTGATGCTCTGTGAGAT 
ATTTGAAATTGAACCAATGACTACTTAGGATGGGTTGTGGAATAAGTTTTGATGTGGAATTGCAC 
ATCTACCTTACAATTACTGACCATCCCCAGTAGACTCCCCAGTCCCATAATTGTGTATCTTCCAG 
CCAGGAAT CCTACAC GGGCAGCAT GT AT T T CTACAAAT AAAGT T T T CT T TGCATACCAAAAAAAA 
AAAAAAAAAAA 
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FIGURE 138 



if! 



0 

Hi? 



MRQFPKTS FDI SPEMS FS I YSLQVPAVPGLTCWALTAE PGWGQNKGATTCATNSHSDSELRPE I F 

S SREAWQ FFLLLWS PDFRPKMKAS S LAFS LLS AAFYLLWT PSTGLKTLNLGSCVI ATNLQE I RNG 

FSEIRGSVQAKDGNIDIRILRRTESLQDTKPANRCCLLRHLLRLYLDRVFWYQTPDHYTLRKIS 

SIANSFLTIKKDLRLSHAHMTCHCGE 

E 

Important features of the protein: 
Signal peptide: 



H amino acids 1-42 

£3 

n . 

^ cAMP- and cGMP-dependent protein kinase phosphorylation sites. 

. •. amino acids 192-195, 225-228 

M 

l&r N-myristoylation sites. 



amino acids 42-47, 46-51, 136-141 
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FIGURE 13Q 



CCTGGAGCCGGAAGCGCGGCTGCAGCAGGGCGAGGCTCCAGGTGGGGTCGGTTCCGCATCCAGCC 
TAGCGTGTCCACGATGCGGCTGGGCTCCGGGACTTTCGCTACCTGTTGCGTAGCGATCGAGGTGC 
TAGGGATCGCGGTCTTCCTTCGGGGATTCTTCCCGGCTCCCGTTCGTTCCTCTGCCAGAGCGGAA 
CACGGAGCGGAGCCCCCAGCGCCCGAACCCTCGGCTGGAGCCAGTTCTAACTGGACCACGCTGCC 
ACCACCTCTCTTCAGTAAAGTTGTTATTGTTCTGATAGATGCCTTGAGAGATGATTTTGTGTTTG 
GGTCAAAGGGTGTGAAATTTATGCCCTACACAACTTACCTTGTGGAAAAAGGAGCATCTCACAGT 
T T T GT GGCT GAAGCAAAGCCACCTACAG T TAC TAT GCC TC GAAT CAAGGCATT GATGACGGGGAG 
CCTTCCTGGCTTTGTCGACGTCATCAGGAACCTCAATTCTCCTGCACTGCTGGAAGACAGTGTGA 
TAAGACAAGCAAAAGCAGCTGGAAAAAGAATAGTC T T T TATGGAGAT GAAACC TGGGT TAAAT TA 
T TCCCAAAGCAT T T T GT GGAAT AT GATGGAACAACC TCAT T T T T CG T G TCAGAT TACACAGAGGT 
GGATAATAATGTCACGAGGCATTTGGATAAAGTATTAAAAAGAGGAGATTGGGACATATTAATCC 
TCCACTACCTGGGGCTGGACCACATTGGCCACATTTCAGGGCCCAACAGCCCCCTGATTGGGCAG 
AAGC TGAGCGAGATGGACAGCGTGC T GAT GAAGATCCACACCTCAC T GCAGT CGAAGGAGAGAGA 
F% GACGCCTTTACCCAATTTGCTGGTTCTTTGTGGTGACCATGGCATGTCTGAAACAGGAAGTCACG 

GGGCCTCCTCCACCGAGGAGGTGAATACACCTCTGATTTTAATCAGTTCTGCGTTTGAAAGGAAA 
W CCCGGT GATATC CGACATCCAAAGCACGT CCA ATAGA CGGATG T GGC T GCGACAC TGGCGATAGC 

ACTTGGCTTACCGATTCCAAAAGACAGTGTAGGGAGCCTCCTATTCCCAGTTGTGGAAGGAAGAC 
CAATGAGAGAGCAGT TGAGATTT TTACAT T TGAATACAGTGCAGCT TAGTAAACTGT TGCAAGAG 
AATGTGCCGTCATATGAAAAAGATCCTGGGTTTGAGCAGTTTAAAATGTCAGAAAGATTGCATGG 
GAACTGGATCAGACTGTACTTGGAGGAAAAGCATTCAGAAGTCCTATTCAACCTGGGCTCCAAGG 
TTCTCAGGCAGTACCTGGATGCTCTGAAGACGCTGAGCTTGTCCCTGAGTGCACAAGTGGCCCAG 
111 TTCTCACCCTGCTCCTGCTCAGCGTCCCACAGGCACTGCACAGAAAGGCTGAGCTGGAAGTCCCA 
n CTGTCATCTCCTGGGTTTTCTCTGCTCTTTTATTTGGTGATCCTGGTTCTTTCGGCCGTTCACGT 
f | CATTGTGTGCACCTCAGCTGAAAGTTCGTGCTACTTCTGTGGCCTCTCGTGGCTGGCGGCAGGCT 
JS GCCTTTCGTTTACCAGACTCTGGTTGAACACCTGGTGTGTGCCAAGTGCTGGCAGTGCCCTGGAC 
SI AGGGGGCCTCAGGGAAGGACGTGGAGCAGCCTTATCCCAGGCCTCTGGGTGTCCCGACACAGGTG 

TTCACATCTGTGCTGTCAGGTCAGATGCCTCAGTTCTTGGAAAGCTAGGTTCCTGCGACTGTTAC 
CAAGGTGATTGTAAAGAGCTGGCGGTCACAGAGGAACAAGCCCCCCAGCTGAGGGGGTGTGTGAA 
T CGGACAGCCT CCCAGCAGAGG T GT GGGAGCTGCAGC TGAGGGAAGAAGAGACAAT CGGCC TGGA 
CACTCAGGAGGGTCAAAAGGAGACTTGGTCGCACCACTCATCCTGCCACCCCCAGAATGCATCCT 
GCCTCATCAGGTCCAGATTTCTTTCCAAGGCGGACGTTTTCTGTTGGAATTCTTAGTCCTTGGCC 
TCGGACACCTTCATTCGTTAGCTGGGGAGTGGTGGTGAGGCAGTGAAGAAGAGGCGGATGGTCAC 
ACTCAGATCCACAGAGCCCAGGATCAAGGGACCCACTGCAGTGGCAGCAGGACTGTTGGGCCCCC 
ACCCCAACCCTGCACAGCCCTCATCCCCTCTTGGCTTGAGCCGTCAGAGGCCCTGTGCTGAGTGT 
CTGACCGAGACACTCACAGCTTTGTCATCAGGGCACAGGCTTCCTCGGAGCCAGGATGATCTGTG 
CCACGCTTGCACCTCGGGCCCATCTGGGCTCATGCTCTCTCTCCTGCTATTGAATTAGTACCTAG 
CTGCACACAGTATGTAGTTACCAAAAGAATAAACGGCAATAATTGAGAAAAAAAA 



APP_ID=1 0063732 



Page 293 of 322 



FIGURE 14Q 



MRLGSGTFATCCVAIEVLGIAVFLRGFFPAPVRSSARAEHGAEPPAPEPSAGASSNWTTLPPPLF 
SKWIVLIDALRDDFVFGSKGWFMPYTTYLVEKGASHSFVAEAKPPTVTMPRIKALMTGSLPGF 
VDVI RNLNS PALLEDS VI RQAKAAGKRI VFYGDETWVKL FPKH FVE YDGTT S FFVS DYTE VDNNV 
TRHLDKVLKRGDWDILILHYLGLDHIGHISGPNSPLIGQKLSEMDSVLMKIHTSLQSKERETPLP 
NLLVLCGDHGMSETGSHGASSTEEVNTPLILISSAFERKPGDIRHPKHVQ 

Important features of the protein: 
Signal peptide: 

amino acids 1-34 



Transmembrane domain: 



amino acids 58-76 




N-glycosylation sites. 



amino acids 56-60, 194-198 




N-myristoylation sites. 

amino acids 6-12, 52-58, 100-106, 125-131, 233-239, 270-276, 




275-281, 278-284 



Ami da ti on site/ 



amino acids 154-158 



Cell attachment sequence. 



amino acids 205-208 
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FIGURE 141 



GGCACGAGGCAAGCCTTCCAGGTTATCGTGACGCACCTTGAAAGTCTGAGAGCTACTGCCCTACA 
GAAAGTTACTAGTGCCCTAAAGCTGGCGCTGGCACTG ATG TTACTGCTGCTGTTGGAGTACAACT 
T CC C TAT AGAAAACAAC T GCC AGCACC T T AAGACC AC TCACACC T T C AGAG T GAAGAAC TTAAAC 
CCGAAGAAATTCAGCATTCATGACCAGGATCACAAAGTACTGGTCCTGGACTCTGGGAATCTCAT 
AGCAGT T CCAGATAAAAAC TACATAC GCCCAGAGAT CT T C T T T GCAT TAGCCT CATCCT T GAGCT 
CAGCCTCTGCGGAGAAAGGAAGTCCGATTCTCCTGGGGGTCTCTAAAGGGGAGTTTTGTCTCTAC 
TGTGACAAGGATAAAGGACAAAGTCATCCATCCCTTCAGCTGAAGAAGGAGAAACTGATGAAGCT 
GGCTGCCCAAAAGGAATCAGCACGCCGGCCCTTCATCTTTTATAGGGCTCAGGTGGGCTCCTGGA 
ACATGCTGGAGTCGGCGGCTCACCCCGGATGGTTCATCTGCACCTCCTGCAATTGTAATGAGCCT 
GTTGGGGTGACAGATAAATTTGAGAACAGGAAACACATTGAAT T TTCATT TCAACCAGTT TGCAA 
AGCTGAAATGAGCCCCAGTGAGGTCAGCGATTAGGAAACTGCCCCATTGAACGCCTTCCTCGCTA 

ATTTGAACTAATTGTATAAAAACACCAAACCTGCTCACT 



APP_ID=1 0063732 



Page 295 



FIGURE 142 

MLLLLLEYNFPIENNCQHLKTTHTFRVKNLNPKKFSIHDQDHKVLVLDSGNLIAVPDKNYIRPEI 
FFALASSLSSASAEKGSPILLGVSKGEFCLYCDKDKGQSHPSLQLKKEKLMKLAAQKESARRPFI 
FYRAQVGS WNMLE S AAHPGWFI CTS CNCNEPVGVTDKFENRKH I E FS FQPVCKAEMS P SE VSD 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 33-36 

N-myristoylation site. 

amino acids 50-55, 87-92 

Interleukin-1 

amino acids 37-182. 
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FIGURE 143 



CT AGAGAGTATAGGGCAGAAGGAT GGCAGAT GAGT GAC T CCACATC CAGAGCTGCCTCCCT T TAA 
TCCAGGATCCTGTCCTTCCTGTCCTGTAGGAGTGCCTGTTGCCAGTGTGGGGTGAGACAAGTTTG 
TCCCACAGGGCTGTCTGAGCAGATAAGATTAAGGGCTGGGTCTGTGCTCAATTAACTCCTGTGGG 
CACGGGGGCTGGGAAGAGCAAAGTCAGCGGTGCCTACAGTCAGCACCATGCTGGGCCTGCCGTGG 
AAGGGAG'GTCTGTCCTGGGCGCTGCTGCTGCTTCTCTTAGGCTCCCAGATCCTGCTGATCTATGC 
CTGGCATTTCCACGAGCAAAGGGACTGTGATGAACACAATGTCATGGCTCGTTACCTCCCTGCCA 
CAGT GGAG T TTGC TG TCCACACATT CAAC CAACAGAGCAAGGACTAC TATGCCTACAGACT GGGG 
CACATCTTGAATTCCTGGAAGGAGCAGGTGGAGTCCAAGACTGTATTCTCAATGGAGCTACTGCT 
GGGGAGAACTAGGTGTGGGAAAT T T GAAGACGACAT T GACAAC T GCCATT T CCAAGAAAGCACAG 
AGC T GAACAATAC T T T CAC C T GCT T CTT CACCAT CAGCACCAGGC CCT GGAT GACTCAGT T CAGC 



M C TCCT GAACAAGACC T GC TTGGAGGGAT T C CAC TGAGTGAAACCCACT CACAGGCT TGTCCATGT 

m 

0- 



GCTGCTCCCACATTCCGTGGACATCAGCACTACTCTCCTGAGGACTCTTCAGTGGCTGAGCAGCT 
TTGGACTTGTTTGTTATCCTATTTTGCATGTGTTTGAGATCTCAGATCAGTGTTTTAGAAAATCC 
y AC ACAT CT T GAGCC TAAT CATG TAGT G T AGAT CAT TAAAC AT CAGCAT T T T AAGAAAAAAAAAAA 

111 AAAAAAAAAAAAAAAAAAAAAAAAAAAAAA?\AAAA^ 



m 

Hi? 



APP_ID=1 0063732 



Page 297 of 322 



FIGURE \a± 



s :: j 



a 
m 



MLGLPWKGGLSWALLLLLLGSQILLIYAWHFHEQRDCDEHNVMARYLPATVEFAVHTFNQQSKDY 
YAYRLGHILNSWKEQVESKTVFSMELLLGRTRCGKFEDDIDNCHFQESTELNNTFTCFFTISTRP 
WMTQFSLLNKTCLEGFH 

Important features of" the protein: 
Signal peptide: 

amino acids 1-25 



|4 N-glycosylation sites. 

amino acids 117-121, 139-143 



N-myristoylation site. 

amino acids 9-15 
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FIGURE 145 



CTGTGCAGCTCGAGGCTCCAGAGGCACACTCCAGAGAGAGCCAAGGTTCTGACGCGATGAGGAAG 
CACCTGAGCTGGTGGTGGCTGGCCACTGTCTGCATGCTGCTCTTCAGCCACCTCTCTGCGGTCCA 
GACGAGGGGCATCAAGCACAGAATCAAGTGGAACCGGAAGGCCCTGCCCAGCACTGCCCAGATCA 
CTGAGGCCCAGGTGGCTGAGAACCGCCCGGGAGCCTTCATCAAGCAAGGCCGCAAGCTCGACATT 
GACT T CGGAGCCGAGGGCAACAGGTACTAC GAGGCCAAC TACT GGCAGTT CCCCGATGGCAT CCA 
CTACAACGGCTGCTCTGAGGCTAATGTGACCAAGGAGGCATTTGTCACCGGCTGCATCAATGCCA 
CCCAGGCGGCGAACCAGGGGGAGTTCCAGAAGCCAGACAACAAGCTCCACCAGCAGGTGCTCTGG 
CGGCTGGTCCAGGAGCTCTGCTCCCTCAAGCATTGCGAGTTTTGGTTGGAGAGGGGCGCAGGACT 
TCGGGTCACCATGCACCAGCCAGTGCTCCTCTGCCTTCTGGCTTTGATCTGGCTCATGGTGAAAT 
mi AAGCTTGCCAGGAGGCTGGCAGTACAGAGCGCAGCAGCGAGCAAATCCTGGCAAGTGACCCAGCT 

tnf . 

CTTCTCCCCCAAACCCACGCGTGTTCTGAAGGTGCCCAGGAGCGGCGATGCACTCGCACTGCAAA 
pi . TGCCGCTCCCACGTATGCGCCCTGGTATGTGCCTGCGTTCTGATAGATGGGGGACTGTGGCTTCT 
U! CCGT CAC T CCAT T C TCAGCCCC T AGCAGAGCGT C TGGCACACTAGAT TAGT AGT AAAT GCT TGAT 

GAGAAGAACACATCAGGCACTGCGCCACCTGCTTCACAGTACTTCCCAACAACTCTTAGAGGTAG 
GT GT AT T CCCGT TT T ACAGATAAGGAAACT GAGGCCCAGAGAGC T GAAGTAC TGCACCCAGCATC 
ACCAGCTAGAAAGTGGCAGAGCCAGGATTCAACCCTGGCTTGTCTAACCCCAGGTTTTCTGCTCT 



C| GTCCAATTCCAGAGCTGTCTGGTGATCACTTTATGTCTCACAGGGACCCACATCCAAACATGTAT 



CT CT AAT GAAAT TG T GAAAGC TC CAT GT T TAGAAATAAATGAAAACACC TGA 
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FIGURE 146 

MRKHLS WWWLATVCMLLFSHLSAVQTRG IKHR IKWNRKALPSTAQ I TEAQVAENRPGAFI KQGRK 
LDIDFGAEGNRYYEANYWQFPDGIHYNGCSEANVTKEAFVTGCINATQAANQGEFQKPDNKLHQQ 
VLWRLVQELCS LKHCEFWLERGAGLRVTMHQPVLLCLLAL IWLMVK 

Important features of the protein: 
Signal peptide: 

amino acids 1-26 

Transmembrane domain: 

amino acids 157-171 

N-glycosylation sites. 

amino acids 98-102, 110-114 

Tyrosine kinase phosphorylation site. 

amino acids 76-83 

N-myristoylation sites. 

amino acids 71-77, 88-94, 93-99, 107-113, 154-160 

Amidation site. 

amino acids 62-66 
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FIGURE 147 



GCCTTGGCCTCCCAAAGGGCTGGGATTATAGGCGTGACCACCATGTCTGGTCCAGAGTCTCATTT 
CCTGATGATTTATAGACTCAAAGAAAACTCATGTTCAGAAGCTCTCTTCTCTTCTGGCCTCCTCT 
CTGTCTTCTTTCCCTCTTTCTTCTTATTTTAATTAGTAGCATCTACTCAGAGTCATGCAAGCTGG 
AAATCTTTCATTTTGCTTGTCAGTGGGGTAGGTCACTGAGTCTTAGTTTTTATTTTTTGAAATTT 
CAACTTTCAGATTCAGGGGGTACATGTGAAGGTTTGTTTTATGAGTATATTGCATGATGCTGAGG 
TTTGGGGT 



0! 

H 

w 



m 

Wis 

HI 
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FIGURE 148 



MFRSSLLFWPPLCLLSLFLLILISSIYSESCKLEIFHFACQWGRSLSL'SFYFLKFQLSDSGGTCE 
GLFYEYIA 

Important features of the protein: 
Signal peptide: 

amino acids 1-25 

N-myristoylation site. 

amino acids 62-68 

U 
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FIGURE 14Q 



GTCTCCGCGTCACAGGAACTTCAGCACCCACAGGGCGGACAGCGCTCCCCTCTACCTGGAGACTTGAC 
TCCCGCGCGCCCCAACCCTGCTTATCCCTTGACCGTCGAGTGTCAGAGATCCTGCAGCCGCCCAGTCC 
CGGCCCCTCTCCCGCCCCACACCCACCCTCCTGGCTCTTCCTGTTTTTACTCCTCCTTTTCATTCATA 
ACAAAAGCTACAGCTCCAGGAGCCCAGCGCCGGGCTGTGACCCAAGCCGAGCGTGGAAGAATGGGGTT 
CCTCGGGACCGGCACTTGGATTCTGGTGTTAGTGCTCCCGATTCAAGCTTTCCCCAAACCTGGAGGAA 
GCCAAGACAAATCTCTACATAATAGAGAATTAAGTGCAGAAAGACCTTTGAATGAACAGATTGCTGAA 
GCAGAAGAAGACAAGAT TAAAAAAACATAT C CT CCAGAAAACAAGCCAGGTC AGAGCAACTAT T CTT T 
TGTTGATAACTTGAACCTGCTAAAGGCAATAACAGAAAAGGAAAAAATTGAGAAAGAAAGACAATCTA 
TAAGAAGCTCCCCACTT GATAATAAGT TGAATGT GGAAGATGT TGATT CAACCAAGAAT C GAAAACT G 
CI ATCGATGAT TATGACTCTACTAAGAGT GGATT GGAT CAT AAAT T T CAAGAT GAT CCAGAT GGT CT TCA 

Cl TCAACTAGACGGGACTCCTTTAACCGCTGAAGACATTGTCCATAAAATCGCTGCCAGGATTTATGAAG 

gi 

fj. AAAAT GACAGAGCCGTGTTT GACAAGAT T GTT T CTAAACTACT TAATCT CGGC CTTATCACAGAAAGC 

Z* \ CAAGCACATACACTGGAAGATGAAGTAGCAGAGGTT TTACAAAAATTAATCT CAAAGGAAGCCAACAA 

'■•ii 

f e | T TAT GAGGAGGAT CCCAATAAGCC CACAAGCTGGACTGAGAAT CAGGCT GGAAAAAT ACCAGAGAAAG 

f|J TGACTCCAATGGCAGCAATTCAAGATGGTCTTGCTAAGGGAGAAAACGATGAAACAGTATCTAACACA 

* T TAACCT TGACAAAT GGCTT GGAAAGGAGAAC TAAAACCTACAGT GAAGACAACTTT GAGGAACTCCA 

fatf ATAT T TC CCAAAT TT CTATGCGCT ACT GAAAA.GTAT TGAT TCAGAAAAAGAAGCAAAAGAGAAAGAAA 

In 

CACTGAT TACTAT CATGAAAACACTGATT GACT TTGTGAAGAT GATGGT GAAATATGGAACAATATCT 
% CCAGAAGAAGGTGTTTCCTACCTTGAAAACTTGGATGAAATGATTGCTCTTCAGACCAAAAACAAGCT 
55 AGAAAAAAATGCTACTGACAATATAAGCAAGCTTTTCCCAGCACCATCAGAGAAGAGTCATGAAGAAA 



f|J CAGACAGT ACCAAGGAAGAAGCAGCTAAGATGGAAAAGGAATATGGAAGCTT GAAGGATT CCACAAAA 

GATGATAACTCCAACCCAGGAGGAAAGACAGATGAACCCAAAGGAAAAACAGAAGCCTATTTGGAAGC 
CATCAGAAAAAATATTGAATGGTTGAAGAAACATGACAAAAAGGGAAATAAAGAAGATTATGACCTTT 
CAAAGATGAGAGACTTCATCAATAAACAAGCTGATGCTTATGTGGAGAAAGGCATCCTTGACAAGGAA 
GAAGCCGAGGCCATCAAGCGCATTTATAGCAGCCTGTAAAAATGGCAAAAGATCCAGGAGTCTTTCAA 
CTGTTTCAGAAAACATAATATAGCTTAAAACACTTCTAATTCTGTGATTAAAATTTTTTGACCCAAGG 
GTTAT TAGAAAGTGCT GAAT TTACAGTAGTTAACCT T TTACAAGT GGTTAAAACATAGCT TTCTT CCC 
GTAAAAACTATCTGAAAGTAAAGTTGTATGTAAGCTGAAAAAAAAAAAAAAAAAAAA 



APP ID- 



10063732 



Page 303 of 322 



FIGURE ISO 



MGFLGTGTWILVLVLPIQAFPKPGGSQDKSLHNRELSAERPLNEQIAEAESDKIKKTYPPENKPG 
QSNYSFVDNLNLLKAITEKEKIEKERQSIRSSPLDNKLNVEDVDSTKNRKLIDDYDSTKSGLDHK 
FQDDPDGLHQLDGTPLTAEDIVHKIAARIYEENDRAVFDKIVSKLLNLGLITESQAHTLEDEVAE 
VLQKLISKEANNYEEDPNKPTSWTENQAGKIPEKVTPMAAIQDGLAKGENDETVSNTLTLTNGLE 
RRTKT YSEDNFEELQYFPNFYALLKS IDSEKEAKEKE TL I T IMKTLI DFVKMMVKYGT I S PEEGV 
SYLENLDEMIALQTKNKLEKNATDNISKLFPAPSEKSHEETDSTKEEAAKMEKEYGSLKDSTKDD 
NSNPGGKTDEPKGKTEAYLEAIRKNIEWLKKHDKKGMKEDYDLSKMRDFINKQADAYVEKGILDK 
EEAEAIKRIYSSL 




N-glycosylation sites: 



amino acids 68-71, 346-349, 350-353 



Casein kinase II phosphorylation site: 

amino acids 70-73, 82-85, 97-100, 125-128, 147-150, 188-191, 217- 

220, 265-268, 289-292, 305-308, 320-323, 326-329, 362-365, 368- 
341, 369-372, 382-385, 386-389, 387-390 




N-myristoylation sites: 

amino acids 143-148, 239-244 
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FIGURE 151 



CGGCTCGAGGCTCCCGCCAGGAGAAAGGAACATTCTGAGGGGAGTCTACACCCTGTGGAGCTCAA 
GATGGTCCTGAGTGGGGCGCTGTGGTTCCGAATGAAGGACTCGGCATTGAAGGTGCTTTATCTGC 
ATAAT AACCAGC TT C TAGC TGGAGGGC TGCAT GCAGGGAAGGT CATTAAAGGTGAAGAGATCAGC 
GTGGTCCCCAATCGGTGGCTGGATGCCAGCCTGTCCCCCGTCATCCTGGGTGTCCAGGGTGGAAG 
CCAGTGCCTGTCATGTGGGGTGGGGCAGGAGCCGACTCTAACACTAGAGCCAGTGAACATCATGG 
AGCTCTATCTTGGTGCCAAGGAATCCAAGAGCTTCACCTTCTACCGGCGGGACATGGGGCTCACC 
TCCAGCTTCGAGTCGGCTGCCTACCCGGGCTGGTTCCTGTGCACGGTGCCTGAAGCCGATCAGCC 
T G T C AGAC T C AC CC AGC T T C C C G AGAAT G G T GGC T GGAAT GCC C C CAT CACAGAC T T C TAC T TC C 
AGCAGTGTGACTAGGGCAACGTGCCCCCCAGAACTCCCTGGGCAGAGCCAGCTCGGGTGAGGGGT 
GAGTGGAGGAGACCCATGGCGGACAATCACTCTCTCTGCTCTCAGGACCCCCACGTCTGACTTAG 
TGGGCACCTGACCACTTTGTCTTCTGGTTCCCAGTTTGGATAAATTCTGAGATTTGGAGCTCAGT 
CCACGGTCCTCCCCCACTGGATGGTGCTACTGCTGTGGAACCTTGTAAAAACCATGTGGGGTAAA 
CTGGGAATAACATGAAAAGATTTCTGTGGGGGTGGGGTGGGGGAGTGGTGGGAATCATTCCTGCT 
H TAATGGTAACTGACAAGTGTTACCCTGAGCCCCGCAGGCCAACCCATCCCCAGTTGAGCCTTATA 
fp GGGTCAGTAGCTCTCCACATGAAGTCCTGTCACTCACCACTGTGCAGGAGAGGGAGGTGGTCATA 
£\ GAGTCAGGGATCTATGGCCCTTGGCCCAGCCCCACCCCCTTCCCTTTAATCCTGCCACTGTCATA 
JjjJ TGCTACCTTTCCTATCTCTTCCCTCATCATCTTGTTGTGGGCATGAGGAGGTGGTGATGTCAGAA 
J*; GAAAT GGC TCGAGCT CAGAAGAT AAAAGATAAGTAGGGTAT GC TGAT C CTCT TT TAAAAACCCAA 

W GAT ACAAT CAAAAT CC CAGATGC TGGT CT CT AT T CCCAT GAAAAAGT GCT CATGACATAT TGAGA 

%l AGACCTACTTACAAAGTGGCATATATT GCAATT TATT TTAATTA7VAAGATACCTATTTATATAT T 

TCTTTATAGAAAAAAGTCTGGAAGAGTTTACTTCAATTGTAGCAATGTCAGGGTGGTGGCAGTAT 
AGGTGATTTTTCTTTTAATTCTGTTAATTTATCTGTATTTCCTAATTTTTCTACAATGAAGATGA 
AT TCCTTG TATAAAAATAAGAAAAGAAAT TAAT CT TGAGGTAAGC AGAGCAGACATCATCTC TGA 
* TTGTCCTCAGCCTCCACTTCCCCAGAGTAAATTCAAATTGAATCGAGCTCTGCTGCTCTGGTTGG 
0 TT GT AGTAGT GATCAGGAAACAGAT CT CAGCAAAGCCAC TGAGGAGGAGGCTGTGC TGAGT TTGT 

m GTGGCTGGAATCTCTGGGTAAGGAACTTAAAGAACAAAAATCATCTGGTAATTCTTTCCTAGAAG 
^ GATCACAGCCCCTGGGATTCCAAGGCATTGGATCCAGTCTCTAAGAAGGCTGCTGTACTGGTTGA 

ATTGTGTCCCCCTCAAATTCACATCCTTCTTGGAATCTCAGTCTGTGAGTTTATTTGGAGATAAG 
81 GTCT C T GCAGAT GTAGT TAGT TAAGACAAGGT CATGC TGGATGAAGGTAGAC C TAAAT TCAATAT 

C| GACTGGTTTCCTTGTATGAAAAGGAGAGGACACAGAGACAGAGGAGACGCGGGGAAGACTATGTA 
Ifl \ AAGATGAAGGCAGAGATCGGAGTTTTGCAGCCACAAGCTAAGAAACACCAAGGATTGTGGCAACC 

AT CAGAAGC T T GGAAGAGGCAAAGAAGAAT T C T T CCC TAGAGGC T TT AGAGGGATAACGGC TCT G 
CTGAAACCTTAATCTCAGACTTCCAGCCTCCTGAACGAAGAAAGAATAAATTTCGGCTGTTTTAA 
GCCACCAAGGATAAT TGGT TACAGCAGCT CTAGGAAACTAATACAGCTGCTAAAATGATCCC TGT 
CTCCTCGTGTTTACATTCTGTGTGTGTCCCCTCCCACAATGTACCAAAGTTGTCTTTGTGACCAA 
T AGAAT AT GGCAGAAGTGATGGCAT GC CACT TCCAAGAT TAGGT TATAAAAGACAC TGCAGC TT C 
TACTTGAGCCCTCTCTCTCTGCCACCCACCGCCCCCAATCTATCTTGGCTCACTCGCTCTGGGGG 
AAGCTAGCTGCCATGCTATGAGCAGGCCTATAAAGAGACTTACGTGGTAAAAAATGAAGTCTCCT 
GCCCACAGCCACATTAGTGAACCTAGAAGCAGAGACTCTGTGAGATAATCGATGTTTGTTGTTTT 
AAGT TGCTCAGT TT TGGTC TAACTTGT TATGCAGCAATAGATAAATAATATGCAGAGAAAGAG 
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MVLSGALCFRMKDSALKVLYLHNNQLIAGGLHAGKVIKGEEISWPNRWLDASLSPVILGVQGGS 
QCLSCGVGQEPTLTLEPVNIMELYLGAKESKSFTFYRRDMGLTSSFSSAAYPGWFLCTVPEADQP 
VRLTQLPENGGWNAPITDFYFQQCD 

N-myristoylation sites. 

amino acids 29-34, 30-35, 60-65, 63-68, 73-78, 91-96, 106-111 
Interleukin-1 signature. 



amino acids 111-131 




Interleukin-1 proteins . 

amino acids 8-29, 83-120, 95-134, 64-103 
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FIGURE 153 



CTTCAGAACAGGTTCTCCTTCCCCAGTCACCAGTTGCTCGAGTTAGAATTGTCTGCAATGGCCGC 
CCTGCAGAAATCTGTGAGCTCTTTCCTTATGGGGACCCTGGCCACCAGCTGCCTCCTTCTCTTGG 
CCCTCTTGGTACAGGGAGGAGCAGCTGCGCCCATCAGCTCCCACTGCAGGCTTGACAAGTCCAAC 
T TCCAGCAGCCCTATATCACCAACCGCACCT TCATGC TGGC TAAGGAGGCTAGC T TGGCTGATAA 
CAACACAGACG T TC GT C TCAT T GGGGAGAAACT GT TCCACGGAGT C AGTAT GAG T GAGCGC T GCT 
ATCTGATGAAGCAGGTGCTGAACTTCACCCTTGAAGAAGTGCTGTTCCCTCAATCTGATAGGTTC 
CAGCCTTATATGCAGGAGGTGGTGCCCTTCCTGGCCAGGCTCAGCAACAGGCTAAGCACATGTCA 
TAT T GAAGG TGATGACC TGCATATCCAGAGGAATGT GCAAAAGC T GAAGGACACAGTGAAAAAGC 
H T TGGAGAGAGTGGAGAGATCAAAGCAATT GGAGAACTGGAT TT GCTGT TTATGTCTC TGAGAAAT 

W . G C C T G CAT T TGACCAGAGC AAAG C T G AAAAAT GAATAAC TAAC CCCCTTTCCCTGC T AGAAAT AA 



CAAT T AGAT GCCCCAAAGC GAT T T T T T T T AACCAAAAGGAAGAT GGGAAGCC AAAC T CCAT CAT G 
ATGGGTGGATTCCAAATGAACCCCTGCGTTAGTTACAAAGGAAACCAATGCCACTTTTGTTTATA 



Ui 

%l AGACCAGAAGG TAGAC TT T C TAAGCATAGAT AT T TAT TGATAACAT T T CAT T GTAAC T GGTGT T C 

Ul T AT ACAC AGAAAACAAT T T ATT T T TT AAAT AAT TGT.C T T T T T CCAT AAAAAAGAT TAC T T T CCAT 

W TCCTTTAGGGGAAAAAACCCCTAAATAGCTTCATGTTTCCATAATCAGTACTTTATATTTATAAA 

^ T GT AT TT AT TAT TAT TATAAGACT GCAT T T TAT T TAT ATCATT T TAT TAATATGGAT T TAT T TAT 

I m AGAAACATCAT TCGATAT TGCT AC TT GAG T GTAAGGC TAATAT T GAT AT T T ATGACAATAAT TAT 



AGAGCTATAACATGT T TATT TGACCT CAATAAACACT TGGATATCCC 
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FIGURE 154 



MAALQKS VS S FLMGTLATS CLLLLALLVQGGAAAP I S SHCRLDKSNFQQPY I TNRT FMLAKEAS L 
ADNNTDVRLIGEKLFHGVSMSERCYLMKQVLNFTLEEVLFPQSDRFQPYMQEWPFLARLSNRLS 
TCHIEGDDLHIQRNVQKLKDTVKKLGESGEIKAIGELDLLFMSLRNACI 

Important features of the protein: 
Signal peptide: 

amino acids 1-33 

N-glycosylation sites. 




N-myristoylation sites. 



amino acids 54-58, 68-72, 97-101 



amino acids 14-20, 82-88 



Prokaryotic membrane lipoprotein lipid attachment site. 



amino acids 10-21 



%4 
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FIGURE 155 



GGCTTGCTGAAAATAAAATCAGGACTCCTAACCTGCTCCAGTCAGCCTGCTTCCACGAGGCCTGT 
CAGT CAG T G C C C GAC T TG T GAC T GAG T G T GC AG T GC C C AGCAT G T AC C AG G T C AG T GC AGAGGGC 
TGCCTGAGGGCTGTGCTGAGAGGGAGAGGAGCAGAGATGCTGCTGAGGGTGGAGGGAGGCCAAGC 
TGCCAGGTTTGGGGCTGGGGGCCAAGTGGAGTGAGAAACTGGGATCCCAGGGGGAGGGTGCAGAT 
GAGGGAGCGACCCAGATTAGGTGAGGACAGTTCTCTCATTAGCCTTTTCCTACAGGTGGTTGCAT 
TCTTGGCAATGGTCATGGGAACCCACACCTACAGCCACTGGCCCAGCTGCTGCCCCAGCAAAGGG 
CAGGACACCTCTGAGGAGCTGCTGAGGTGGAGCACTGTGCCTGTGCCTCCCCTAGAGCCTGCTAG 
GC CCAACC GCCACCCAGAGTCC T GTAGGGC CAGT GAAGA'T GGACCCC TC AACAGCAGGGCCATCT 
y ? CCCCCTGGAGATATGAGTTGGACAGAGACTTGAACCGGCTCCCCCAGGACCTGTACCACGCCCGT 
p TGCCTGTGCCCGCACTGCGTCAGCCTACAGACAGGCTCCCACATGGACCCCCGGGGCAACTCGGA 
M GCTGCTCTACCACAACCAGACTGTCTTCTACAGGCGGCCATGCCATGGCGAGAAGGGCACCCACA 

m 

M AGGGCTACTGCCTGGAGCGCAGGCTGTACCGTGTTTCCTTAGCTTGTGTGTGTGT-GCGGCCCCGT 

m 

?<y GTGATGGGCTAGCCGGACCTGCTGGAGGCTGGTCCCTTTTTGGGAAACCTGGAGCCAGGTGTACA 
yj ACCACTTGCCATGAAGGGCCAGGATGCCCAGATGCTTGGCCCCTGTGAAGTGCTGTCTGGAGCAG 
Pi! CAGGATCCCGGGACAGGATGGGGGGCTTTGGGGAAAACCTGCACTTCTGCACATTTTGAAAAGAG 

CAGCTGCTGCTTAGGGCCGCCGGAAGCTGGTGTCCTGTCATTTTCTCTCAGGAAAGGTTTTCAAA 
GTTCTGCCCATTTCTGGAGGCCACCACTCCTGTCTCTTCCTCTTTTCCCATCCCCTGCTACCCTG 
GCCCAGCACAGGCAC T T TC TAGATAT T TCC CCC T TGC TGGAGAAGAAAGAGC CC C TGG T TT TATT 
p;i TGTTTGTTTACTCATCACTCAGTGAGCATCTACTTTGGGTGCATTCTAGTGTAGTTACTAGTCTT 
CI TTGACATGGATGATTCTGAGGAGGAAGCTGTTATTGAATGTATAGAGATTTATCCAAATAAATAT 
VA CTTTATTTAAAAATGAAAAA 



Ml 
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FIGURE 156 



MRERPRLGEDSSLISLFLQWAFLAMVMGTHTYSHWPSCCPSKGQDTSEELLRWSTVPVPPLEPA 
RPNRHPESCRASEDGPLNSRAISPWRYELDRDLNRLPQDLYHARCLCPHCVSLQTGSHMDPRGNS 
ELLYHNQTVFYRRPCHGEKGTHKGYCLERRLYRVSLACVCVRPRVMG 

Important features of the protein: 
Signal peptide: 

amino acids 1-32 




N-glycosylation site. 



amino acids 136-140 



Tyrosine kinase phosphorylation site. 



amino acids 127-135 




111 



N-myristoylation sites. 

amino acids 44-50, 150-156 
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FIGURE 157 



CCGGCGATGTCGCTCGTGCTGCTAAGCCTGGCCGCGCTGTGCAGGAGCGCCGTACCCCGAGAGCC 
GACCGTTCAATGTGGCTCTGAAACTGGGCCATCTCCAGAGTGGATGCTACAACATGATCTAATCC 
C C GGAGAC T T GAGGGAC C T CCGAG T AGAACC T G T TAC AAC TAG T GT T GCAACAGGGGAC TAT TCA 
ATTTTGATGAATGTAAGCTGGGTACTCCGGGCAGATGCCAGCATCCGCTTGTTGAAGGGCACCAA 
GATT TGTG TGAC GGGCAAAAGCAAC T T CCAGT C C TACAGC TGTGT GAGGT GCAAT TACACAGAGG 
CCTTCCAGACTCAGACCAGACCCTCTGGTGGTAAATGGACATTTTCCTACATCGGCTTCCCTGTA 
GAGC TGAACACAGT C TAT T TCAT TGGGGC CCATAATAT TCC TAAT GCAAATATGAAT GAAGATGG 
CCCTTCCATGTCTGTGAATTTCACCTCACCAGGCTGCCTAGACCACATAATGAAATATAAAAAAA 
AGTGTGTCAAGGCCGGAAGCCTGTGGGATCCGAACATCACTGCTTGTAAGAAGAATGAGGAGACA 
GTAGAAGTGAAC TTCACAACCACTCCCCTGGGAAACAGATACAT GGC TCTTATCCAACACAGCAC 



m TAT CATCGGGT T TT C TCAGGTGT TT GAGCCACACCAGAAGAAACAAACGC GAGC T TCAGTGGTGA 



fl! TTCCAGTGACTGGGGATAGTGAAGGTGCTACGGTGCAGCTGACTCCATATTTTCCTACTTGTGGC 

fal AGCGACTGCATCCGACATAAAGGAACAGTTGTGCTCTGCCCACAAACAGGCGTCCCTTTCCCTCT 

, * GGATAACAACAAAAGCAAGCCGGGAGGCTGGCTGCCTCTCCTCCTGCTGTCTCTGCTGGTGGCCA 

Hi CATGGGTGCTGGTGGCAGGGATCTATCTAATGTGGAGGCACGAAAGGATCAAGAAGACTTCCTTT 

s TCTACCACCACACTACTGCCCCCCATTAAGGTTCTTGTGGTTTACCCATCTGAAATATGTTTCCA 

O TCACACAATTTGTTACTTCACTGAATTTCTTCAAAACCATTGCAGAAGTGAGGTCATCCTTGAAA 

111 AGTGGCAGAAAAAGAAAATAGCAGAGATGGGTCCAGTGCAGTGGCTTGCCACTGAAAAGAAGGCA 

GCAGACAAAGTCGTCTTCCTTCTTTCCAATGACGTCAACAGTGTGTGCGATGGTACCTGTGGCAA 



GAGCGAGGGCAGTCCCAGTGAGAACTCTCAAGACCTCTTCCCCCTTGCCTTTAACCTTTTCTGCA 
G TGAT CT AAGAAGCCAGAT TCAT CT GCACAAATACG TGGTGGT C TAC T TT AGAGAGAT T GATACA 



AAAGACGAT TACAAT GCT CTCAGTGT CT GCCCCAAG TAC CACC TCAT GAAG GAT GCCAC TGCT T T 
C TGT GCAGAAC T TC TCCAT GTCAAGCAGCAGGT GTCAGCAGGAAAAAGAT CACAAGCC T GC CACG 
ATGGCTGCTGCTCCTTGTAG 
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FIGURE is8 



MSLVLLSLAALCRSAVPREPTVQCGSETGPSPEWMLQHDLIPGDLRDLRVEPVTTSVATGDYSILMNVSWV- 
LRADASIRLLKATKICVTGKSNFQSYSCVRCNYTEAFQTQTRPSGGKWTFSYIGFPVELNTVYFIGAHNIP 
NANMNE DG P SMS VN FTSPGCLDH IMKY KKKC VKAGS LW D PN I T AC KKN E E T VE VN FT TT P LGNR YMAL I Q H 
STIIGFSQVFEPHQECKQTRASVVIPVTGDSEGATVQLTPYFPTCGSDCIRHKGTWLCPQTGVPFPLDNNK 
SKPGGWLPLLLLSLLVATWVLVAGIYLMWRHERIKKTSFSTTTLLPPIKVLWYPSEICFHHTICYFTEFL 
QNHCRSEVILEKWQKKKIAEMGPVQWLATQKKAADKWFLLSNDVNSVCDGTCGKSEGSPSENSQDLFPLA 
FNLFCSDLRSQIHLHKYVWYFREIDTKDDYNALSVCPKYHLMKDATAFCAELLHVKQQVSAGKRSQACHD 
GCCSL 

Important features of the protein: 
Signal peptide: 

amino acids 1-14 

Transmembrane domain: 



IhJ amino acids 290-309 

■zr.\t 



N-glycosylation sites . 



LI . amino acids 67 - 71, 103 - 107, 156 - 160, 183 - 187, 197 - 201 and 283 



- 287 



cAMP- and cGMP-dependent protein kinase phosphorylation sites. 

amino acids 228 - 232 and 319 - 323 

Casein kinase II phosphorylation sites. 

amino acids 178 - 182, 402 - 406, 414 - 418 and 453 - 457 

N-myristoylation site. 

amino acids 116-122 



Amidation site. 

amino acids 488-452 
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FIGURE 15Q 
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AGCCACCAGCGCAACATGACAGTGAAGACCCTGCATGGCCCAGCCATGGTCAAGTACTTGCTGCT 
GTCGATATTGGGGCTTGCCTTTCTGAGTGAGGCGGCAGCTCGGAAAATCCCCAAAGTAGGACATA 
CTTTTTTCCAAAAGCCTGAGAGTTGCCCGCCTGTGCCAGGAGGTAGTATGAAGCTTGACATTGGC 
ATCATCAATGAAAACCAGCGCGTTTCCATGTCACGTAACATCGAGAGCCGCTCCACCTCCCCCTG 
GAATTACACTGTCACTTGGGACCCCAACCGGTACCCCTCGGAAGTTGTACAGGCCCAGTGTAGGA 
ACTTGGGCTGCATCAATGCTCAAGGAAAGGAAGACATCTCCATGAATTCCGTTCCCATCCAGCAA 
GAGACCCTGGTCGTCCGGAGGAAGCACCAAGGCTGCTCTGTTTCTTTCCAGTTGGAGAAGGTGCT 
GGTGACTGTTGGCTGCACCTGCGTCACCCCTGTCATCCACCATGTGCA GTAAG AGGTGCATATCC 
ACTCAGCTGAAGAAG 
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FIGURE 160 

MTWTLHGPAWKYLLLSILGLAFLSEAAARKIPKVGHTFFQKPESCPPVPGGSMKLDIGIINEN 
QRVSMSRNIESRSTSPWNYTVTWDPNRYPSEWQAQGRNLGCINAQGKEDISMNSVPIQQETLW 
RRKHQGCS VS FQLEKVLVT VGCTCVT PVI HHVQ 

Signal sequence: 

amino acids 1-30 

N-glycosylation site. 

amino acids 83-87 

N-myristoylation sites. 

amino acids 106-111, 136-141 
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FIGURE 161 



ACAC T GGCCAAACAAAAACGAAAGCAC TC CG TGC TGGAAGTAGGAGGAGAGT CAGGAC TCCCAGG 
ACAGAGAGTGCACAAACTACCCAGCACAGCCCCCTCCGCCCCCTCTGGAGGCTGAAGAGGGATTC 
CAGCCCCTGCCACCCACAGACACGGGCTGACTGGGGTGTCTGCCCCCCTTGGGGGGGGGCAGCAC 
AGGGCCTCAGGCCTGGGTGCCACCTGGCACCTAGAAGATGCCTGTGCCCTGGTTCTTGCTGTCCT 
TGGCACTGGGCCGAAGCCCAGTGGTCCTTTCTCTGGAGAGGCTTGTGGGGCCTCAGGACGCTACC 
CACTGCTCTCCGGGCCTCTCCTGCCGCCTCTGGGACAGTGACATACTCTGCCTGCCTGGGGACAT 
CGTGCCTGCTCCGGGCCCCGTGCTGGCGCCTACGCACCTGCAGACAGAGCTGGTGCTGAGGTGCC 
AGAAGGAGACCGACTGTGACCTCTGTCTGCGTGTGGCTGTCCACTTGGCCGTGCATGGGCACTGG 
GAAGAGCCTGAAGATGAGGAAAAGTTTGGAGGAGCAGCTGACTCAGGGGTGGAGGAGCCTAGGAA 
TGCCTCTCTCCAGGCCCAAGTCGTGCTCTCCTTCCAGGCCTACCCTACTGCCCGCTGCGTCCTGC 
TGGAGGTGCAAGTGCCTGCTGCCCTTGTGCAGTTTGGTCAGTCTGTGGGCTCTGTGGTATATGAC 
TGCTTCGAGGCTGCCCTAGGGAGTGAGGTACGAATCTGGTCCTATACTCAGCCCAGGTACGAGAA 
GGAAC T CAACCACACACAGCAGC T GCCTGCCC T GCC C TGGC TCAACG T GTCAGCAGAT GGTGACA 
ACGTGCATCTGGTTCTGAATGTCTCTGAGGAGCAGCACTTCGGCCTCTCCCTGTACTGGAATCAG 
GTCCAGGGCCCCCCAAAACCCCGGTGGCACAAAAACCTGACTGGACCGCAGATCATTACCTTGAA 
13 CCACACAGACCTGGTTCCCTGCCTCTGTATTCAGGTGTGGCCTCTGGAACCTGACTCCGTTAGGA 

CGAACATCTGCCCCTTCAGGGAGGACCCCCGCGCACACCAGAACCTCTGGCAAGCCGCCCGACTG 
CGACTGCTGACCCTGCAGAGCTGGCTGCTGGACGCACCGTGCTCGCTGCCCGCAGAAGCGGCACT 
GTGCTGGCGGGCTCCGGGTGGGGACCCeTGCCAGCCACTGGTCCCACCGCTTTCCTGGGAGAACG 
TCACTGTGGACAAGGTTCTCGAGTTCCCATTGCTGAAAGGCCACCCTAACCTCTGTGTTCAGGTG 
U AACAGCTCGGAGAAGCTGCAGCTGCAGGAGTGCTTGTGGGCTGACTCCCTGGGGCCTCTCAAAGA 
|| CGATGTGCTACTGTTGGAGACACGAGGCCCCCAGGACAACAGATCCCTCTGTGCCTTGGAACCCA 

GT GG C T GTACT TCACTACCCAGCAAAGCC T CCACGAGGGCAGCT CGCC TT GGAGAG TAC T TAC TA 
CAAGACCTGCAGTCAGGCCAGTGTCTGCAGCTATGGGACGATGACTTGGGAGCGCTATGGGCCTG 
CCCCATGGACAAATACATCCACAAGCGCTGGGCCCTCGTGTGGCTGGCCTGCCTACTCTTTGCCG 
11 CTGCGCTTTCCCTCATCCTCCTTCTCAAAAAGGATCACGCGAAAGGGTGGCTGAGGCTCTTGAAA 

CAGGACGTCCGCTCGGGGGCGGCCGCCAGGGGCCGCGCGGCTCTGCTCCTCTACTCAGCCGATGA 
CTCGGGTTTCGAGCGCCTGGTGGGCGCCCTGGCGTCGGCCCTGTGCCAGCTGCCGCTGC GCGTGG 
% CCGTAGACCTGTGGAGCCGTCGTGAACTGAGCGCGCAGGGGCCCGTGGCTTGGTTTCACGCGCAG 
y, CGGCGCCAGACCCTGCAGGAGGGCGGCGTGGTGGTCTTGCTCTTCTCTCCCGGTGCGGTGGCGCT 
111 GTGCAGCGAGTGGCTACAGGATGGGGTGTCCGGGCCCGGGGCGCACGGCCCGCACGACGCCTTCC 

GCGCCTCGCTCAGCTGCGTGCTGCCCGACTTGTTGCAGGGCCGGGCGCCCGGCAGCTACGTGGGG 
GCCTGCTTCGACAGGCTGCTCCACCCGGACGCCGTACCCGCCCTTTTCCGCACCGTGCCCGTCTT 
CACACTGCCCTCCCAACTGCCAGACTTCCTGGGGGCCCTGCAGCAGCCTCGCGCCCCGCGTTCCG 
GGCGGCTCCAAGAGAGAGCGGAGCAAGTGTCCCGGGGCCTTCAGCCAGCCCTGGATAGCTACTTC 
CATCCCCCGGGGACTCCCGCGCCGGGACGCGGGGTGGGACCAGGGGCGGGACCTGGGGCGGGGGA 
CGGGACTTAAATAAAGGCAGACGCTGTTTTTCTAAAAAAA 



W 
SI 



CI 
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FIGURE 162 



MPVPWFLLSLALGRSPWLSLERLVGPQDATHCSPGLSCRLWDSDILCLPGDIVPAPGPVLAPTHLQTELV 
LRCQKETDCDLCLRVAVHLAVHGHWEEPEDESKFGGAADSGVEEPRNASLQAQWLSFQAYPTARCVLLEV 
QVPAALVQFGQSVGSVVYDCFEAALGSEVRIWSYTQPRYEKELNHTQQLPALPWLNVSADGDNVHLVLNVS 
EEQHFGLSLYWNQVQGPPKPRWHKNLTGPQIITLNHTDLVPCLCIQVWPLEPDSVRTNICPFREDPRAHQN 
LWQAARLRLLTLQSWLLDAPCSLPAEAALCWRAPGGDPCQPLVPPLSWENVTVDKVLEFPLLKGHPNLCVQ 
VNSSEKLQLQECLWADSLGPLKDDVLLLETRGPQDNRSLCALEPSGCTSLPSKASTRAARLGEYLLQDLQS 
GQCLQLWDDDLGALWACPMDKYIHKRWALVWLACLLFAAALSLILLLKKDHAKGWLRLLKQDVRSGAAARG 
RAALLLYSADDSGFERLVGALASALCQLPLRVAVDLWSRRELSAQGPVAWFHAQRRQTLQEGGVWLLFSP 
GAVALCSEWLQDGVSGPGAHGPHDAFRASLSCVLPDFLQGRAPGSYVGACFDRLLHPDAVPALFRTVPVFT 
LPSQLPDFLGALQQPRAPRSGRLQERAEQVSRALQPALDSYFHPPGTPAPGRGVGPGAGPGAGDGT 

? " 



m 

hi 
H 

w 

f'll j 



Signal sequence: 

amino acids 1-20 

Transmembrane domain. 

amino acids 453-475 



ret N-glycosylation sites. 

||i amino acids 118-121, 186-189, 198-201, 211-214, 238-241, 248-251, 

334-337, 357-360, 391-394 

C| Glycosaminoglycan attachment site. 

SSI ■. ■ 

il^l ammo acids 583-586 

cAMP- and cGMP -dependent protein kinase phosphorylation site. 

amino acids 552-555 

N-myristoylation sites. 

amino acids 107-112, 152-157, 319-324, 438-443, 516-521, 612-617, 
692-697, 696-701, 700-705 
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FIGURE 163 



GGGAGGGCTCTGTGCCAGCCCCGATGAGGACGCTGCTGACCATCTTGACTGTGGGATCCCTGGCT 
GCTCACGCCCCTGAGGACCCCTCGGATCTGCTCCAGCACGTGAAATTCCAGTCCAGCAACTTTGA 
AAACATCC TGACG TGGGACAGCGGGC CAGAGGGCACCCCAGACAC GG TC TACAGCATC GAG TATA 
AGACGTACGGAGAGAGGGACTGGGTGGCAAAGAAGGGCTGTCAGCGGATCACCCGGAAGTCCTGC 
AACCTGACGGTGGAGACGGGCAACCTCACGGAGCTCTACTATGCCAGGGTCACCGCT 
GT CAGTGC GGGAGGCCGGT CAGCCACCAAGATGACT GACAGGT T CAGC T CT C TGCAGCACAC TAC 
CCTCAAGCCACCTGATGTGACCTGTATCTCCAAAGTGAGATCGATTCAGATGATTGTTCATCCTA 
CCCCCACGCCAATCCGTGCAGGCGATGGCCACCGGCTAACCCTGGAAGACATCTTCCATGACCTG 
TTCTACCACTTAGAGCTCCAGGTCAACCGCACCTACCAAATGCACCTTGGAGGGAAGCAGAGAGA 
ATATGAGTTCTTCGGCCTGACCCCTGACACAGAGTTCCTTGGCACCATCATGATTTGCGTTCCCA 
CCTGGGCCAAGGAGAGTGCCCCCTACATGTGCCGAGTGAAGACACTGCCAGACCGGACATGGACC 
TACTCCTTCTCCGGAGCCTTCCTGTTCTCCATGGGCTTCCTCGTCGCAGTACTCTGCTACCTGAG 
CTACAGATATGTCACCAAGCCGCCTGCACCTCCCAACTCCCTGAACGTCCAGCGAGTCCTGACTT 
TCCAGCCGCTGCGCTTCATCCAGGAGCACGTCCTGATCCCTGTCTTTGACCTCAGCGGCCCCAGC 



TO AGTCTGGCCCAGCCTGTCCAGTACTCCCAGATCAGGGTGTCTGGACCCAGGGAGCCCGCAGGAGC 

fc:s if 

p% TCCACAGCGGCATAGCCTGTCCGAGATCACCTACTTAGGGCAGCCAGACATCTCCATCCTCCAGC 
g CCTCCAACGTGCCACCTCCCCAGATCCTCTCCCCACTGTCCTATGCCCCAAACGCTGCCCCTGAG 
CP GTCGGGCCCCCATCCTATGCACCTCAGGTGACCCCCGAAGCTCAATTCCCATTCTACGCCCCACA 
|lf GGCCATCTCTAAGGTCCAGCCTTCCTCCTATGCCCCTCAAGCCACTCCGGACAGCTGGCCTCCCT 
\l CCTATGGGGTATGCATGGAAGGTTCTGGCAAAGACTCCCCCACTGGGACACTTTCTAGTCCTAAA 

CACCTTAGGCCTAAAGGTCAGCTTCAGAAAGAGCCACCAGCTGGAAGCTGCATGTTAGGTGGCCT 
TTCTCTGCAGGAGGTGACGTCCTTGGCTATGGAGGAATCCCAAGAAGCAAAATCATTGCACCAGC 
R| C CCTGGGGAT T T GCACAGACAGAACAT CT GACCCAAATGT GCT ACACAG TG GGGAGGAAGGGACA 

g- CCACAGTACCTAAAGGGCCAGCTCCCCCTCCTCTCCTCAGTCCAGATCGAGGGCCACCCCATGTC 

CCTCCCTTTGCAACCTCCTTCCGGTCCATGTTCCCCCTCGGACCAAGGTCCAAGTCCCTGGGGCC 
TGCTGGAGTCCCTTGTGTGTCCCAAGGATGAAGCCAAGAGCCCAGCCCCTGAGACCTCAGACCTG 
GAGCAGCCCACAGAACTGGATTCTCTTTTCAGAGGCCTGGCCCTGACTGTGCAGTGGGAGTCCTG 
CI AGGGGAATGGGAAAGGGTTGGTGCTTCCTCCCTGTCCCTACCCAGTGTCACATCCTTGGCTGTCA 
PI ATCCCATGCCTGCCCATGCCACACACTCTGCGATCTGGCCTCAGACGGGTGCCCTTGAGAGAAGC 

AGAGGGAGTGGCATGCAGGGCCCCTGCCATGGGTGCGCTCCTCACCGGAACAAAGCAGCATGATA 



AGGACTGCAGCGGGGGAGCTCTGGGGAGCAGCTTGTGTAGACAAGCGCGTGCTCGCTGAGCCCTG 
CAAGGCAGAAATGACAGTGCAAGGAGGAAATGCAGGGAAACTCCCGAGGTCCAGAGCCCCACCTC 
CTAACACCATGGATTCAAAGTGCTCAGGGAATTTGCCTCTCCTTGCCCCATTCCTGGCCAGTTTC 
ACAAT C TAGC TCGACAGAGCAT GAGGCCCCT GCC TC T TCT G TCAT TG T T CAAAGGTGGGAAGAGA 
GCCTGGAAAAGAACCAGGCCTGGAAAAGAACCAGAAGGAGGCTGGGCAGAACCAGAACAACCTGC 
ACTTCTGCCAAGGCCAGGGCCAGCAGGACGGCAGGACTCTAGGGAGGGGTGTGGCCTGCAGCTCA 
TTCCCAGCCAGGGCAACTGCCTGACGTTGCACGATTTCAGCTTCATTCCTCTGATAGAACAAAGC 
GAAATGCAGGTCCACCAGGGAGGGAGACACACAAGCCTTTTCTGCAGGCAGGAGTTTCAGACCCT 
ATCCTGAGAATGGGGTTTGAAAGGAAGGTGAGGGCTGTGGCCCCTGGACGGGTACAATAACACAC 
TGTACTGATGTCACAACTTTGCAAGCTCTGCCTTGGGTTCAGCCCATCTGGGCTCAAATTCCAGC 
CTCACCACTCACAAGCTGTGTGACTTCAAACAAATGAAATCAGTGCCCAGAACCTCGGTTTCCTC 
AT CT GTAAT GT GGGGATCATAACACC TAC C T CATGGAG T T GTGGT GAAGAT GAAAT GAAGTCATG 
TCTTTT^AAGTGCTTAATAGTGCCTGGTACATGGGCAGTGCCCAATAAACGGTAGCTATTTAAAAA 
AAAAAAAA 



APP_ID= 
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FIGURE i6a 



MRTLLTILTVGSLAAHAPEDPSDLLQHVKFQSSNFENILTWDSGPEGTPDTVYSIEYKTYGERDW 
VAKKGCQRITRKSCNLTVETGNLTELYYARVTAVSAGGRSATKMTDRFSSLQHTTLKPPDVTCIS 
KVRS I QMI VHPT PTP IRAGDGHRLTLEDI FHDLFYHLELQVNRTYQMHLGGKQREYE FFGLTPDT 
EFLGTIMICVPTWAKESAPYMCRVKTLPDRTWTYSFSGAFLFSMGFLVAVLCYLSYRYVTKPPAP 
PNSLNVQRVLTFQPLRFIQEHVLIPVFDLSGPSSLAQPVQYSQIRVSGPREPAGAPQRHSLSEIT 
YLGQPDISILQPSNVPPPQILSPLSYAPNAAPEVGPPSYAPQVTPEAQFPFYAPQAISPCVQPSSY 
APQATPDSWPPSYGVCMEGSGKDSPTGTLSSPKHLRPKGQLQKEPPAGSCMLGGLSLQEVTSLAM 
EESQEAKSLHQPLGICTDRTSDPNVLHSGEEGTPQYLKGQLPLLSSVQIEGHPMSLPLQPPSGPC 




Transmembrane domain. 



Signal sequence . 



amino acids 233-250 



SPSDQGPSPWGLLESLVCPKDEAKSPAPETSDLEQPTELDSLFRGLALTVQWES 



amino acids 1-17 



« 



N-glycosylation sites. 




N-myristoylation sites. 

amino acids 11-16, 47-52, 102-107, 531-536, 565-570 



amino acids 80-83, 87-90, 172-175 



APP_ID=1 0063732 



Page 318 of 322 



FIGURE 16 s 

TGGCCTACTGGAAAAAAAAAAAAAftA^ 

CTGCGGCGCCGGGGCTGCTCTTCTGGCTGTTCGTGCTGGGGGCGCTCTGGTGGGTCCCGGGCCAG 
TCGGATCTCAGCCACGGACGGCGTTTCTCGGACCTCAAAGTGTGCGGGGACGAAGAGTGCAGCAT 
GTTAATGTACCGTGGGAAAGCTCTTGAAGACTTCACGGGCCCTGATTGTCGTTTTGTGAATTTTA 
AAAAAGGTGACGATGTATATGTCTACTACAAACTGGCAGGGGGATCCCTTGAACTTTGGGCTGGA 
AG T G T T G AACACAG T T T T G GAT AT T T T C C AAAAGAT T T GAT CAAG G T AC T T C AT AAAT AC AC GG A 
AGAAGAGCTACATAT TCCAGCAGATGAGACAGAC TT TGTCTGCT TTGAAGGAGGAAGAGATGATT 
T T AAT AG T T AT AATG TAGAAGAG C T T T TAG GAT C T T T G GAAC T G GAG GAC T C T G T ACC T GAAGAG 
TCGAAGAAAGCTGAAGAAGTTTCTCAGCACAGAGAGAAATCTCCTGAGGAGTCTCGGGGGCGTGA 
AC T T GACCC T GT GCC TGAGCC CGAGGCAT T CAGAGC T GAT T CAGAGG AT GGAGAAGGTGCT T TCT 
CAGAGAGCACCGAGGGGCT GCAGGGACAGCCCT CAGC TCAGGAGAGC CAC CC TCACACCAGCGGT 
CCTGCGGCTAACGCTCAGGGAGTGCAGTCTTCGTTGGACACTTTTGAAGAAATTCTGCACGATAA 
ATTGAAAGTGCCGGGAAGCGAAAGCAGAACTGGCAATAGTTCTCCTGCCTCGGTGGAGCGGGAGA 
AGACAGATGCTTACAAAGTCCTGAAAACAGAAATGAGTCAGAGAGGAAGTGGACAGTGCGTTATT 
CAT TACAGCAAAGGATT TC GTT GGCATCAAAATC TAAGT T T GT T T TACAAAGAT TGTT T T TAGTA 
CTAAGCTGCCTTGGCAGTTTGCATTTTTGAGCCAAACAAAAATATATTATTTTCCCTTCTAAGTA 
AAAAAAAAAAAAAAAAAAAA 
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FIGURE 166 

MAAAPGLL FWL FVLGALWWVPGQS DLSHGRRFS DLKVCGDEECSMLMYRGKALEDFTGPDCRFVN 
FKKGDDVYVYYKLAGGSLELWAGSVEHSFGYFPKDLIKVLHKYTEEELHIPADETDFVCFEGGRD 
DFNSYNVEELLGSLELEDSVPEESKKAEEVSQHREKSPEESRGRELDPVPEPEAFRADSEDGEGA 
FSESTEGLQGQPSAQESHPHTSGPAANAQGVQS SLDTFEE I LHDKLKVPGSESRTGNS S PASVER 
EKT DAYKVLKTEMS QRGS GQC V I H YS KG FRWHQNL S L FYKDC F 

Important features of the protein: 
Signal peptide: 

amino acids 1-22 

N-glycosylation site. 



cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 30-34 



f*i Tyrosine kinase phosphorylation site. 

HI amino acids 67-7 6 



N-myristoylation sites . 

amino acids 205-211, 225-231, 277-283 

Amidation site. 

amino acids 28-32 
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FIGURE 167 



CCAGGACCAGGGCGCACCGGCTCAGCCTCTCAGTTGTCAGAGGCCGGGGAAGAGAAGGAAAGCGC 
AACGGTGTGGTCCAAGCCGGGGCTTCTGCTTCGCCTCTAGGACATACACGGGACCCCCTAACTTC 
AGTCCCCCAAACGCGCACCCTCGAAGTCTTGAACTCCAGCCCCGCACATCCACGCGCGGCACAGG 
CGCGGCAGGCGGCAGGTCCGGGCCGAAGGCGATGCGCGCAGGGGGTCGGGCAGCTGGGCTCGGGC 
GGCGGGAGTAGGGCCCGGCAGGGAGGCAGGGAGGCTGCATATTCAGAGTCGCGGGCTGCGCCCTG 
GGCAGAGGCCGCCCTCGCTCCACGCAACACCTGCTGCTGCCACCGCGCCGCGATGAGCCGCGTGG 
TCTCGCTGCTGCTGGGCGCCGCGCTGC^CTGCGGCCACGGAGCCTTCTGCCGCCGCGTGGTCAGC 
GGCCAAAAGGTGTGTTTTGCTGACTTCAAGCATCCCTGCTACAAAATGGCCTACTTCCATGAACT 
GTCCAGCCGAGTGAGCTTTCAGGAGGCACGCCTGGCTTGTGAGAGTGAGGGAGGAGTCCTCCTCA 
GCCTTGAGAATGAAGCAGAACAGAAGTTAATAGAGAGCATGTTGCAAAACCTGACAAAACCCGGG 
ACAGGGATTTCTGATGGTGATTTCTGGATAGGGCTTTGGAGGAATGGAGATGGGCAAACATCTGG 
TGCCTGCCCAGATCTCTACCAGTGGTCTGATGGAAGCAATTCCCAGTACCGAAACTGGTACACAG 
ATGAACCTTCCTGCGGAAGTGAAAAGTGTGTTGTGATGTATCACCAACCAACTGCCAATCCTGGC 
C T T GGGGG T CCC T AC C T T T AC CAGT GGAAT GATGACAG G T GT AACAT GAAG CACAAT TAT AT T TG 

CAAGTATGAACCAGAGATTAATCCAACAGCCCCTGTAGAAAAGCCTTATCTTACAAATCAACCAG 

GAGACACCCATCAGAATGTGGTTGTTACTGAAGCAGGTATAATTCCCAATCTAATTTATGTTGTT 

ATACCAACAATACCCCTGCTCTTACTGATACTGGTTGCTTTTGGAACCTGTTGTTTCCAGATGCT 

GCATAAAAGTAAAGGAAGAACAAAAACTAGTCCAAACCAGTCTACACTGTGGATTTCAAAGAGTA 

CCAGAAAAGAAAGTGGCATGGAAGT ATAA TAACTCATTGACTTGGTTCCAGAATTTTGTAATTCT 

GGATCTGTATAAGGAATGGCATCAGAACAATAGCTTGGAATGGCTTGAAATCACAAAGGATCTGC 

AAGATGAACTGTAAGCTCCCCCTTGAGGCAAATATTAAAGTAATTTTTATATGTCTATTATTTCA 

TTTAAAGAATATGCTGTGCTAATAATGGAGTGAGACATGCTTATTTTGCTAAAGGATGCACCCAA 

ACTTCAAACTTCAAGCAAATGAAATGGACAATGCAGATAAAGTTGTTATCAACACGTCGGGAGTA 

TGTGTGTTAGAAGCAATTCCTTTTATTTCTTTCACCTTTCATAAGTTGTTATCTAGTCAATGTAA 

TGTATATTGTATTGAAATTTACAGTGTGCAAAAGTATTTTACCTTTGCATAAGTGTTTGATAAAA 

ATGAACTGTTCTAATATTTATTTTTATGGCATCTCATTTTTCAATACATGCTCTTTTGATTAAAG 

AAACTTATTACTGTTGTCAACTGAATTCACACACACACAAATATAGTACCATAGAAAAAGTTTGT 

TTTCTCGAAATAATTCATCTTTCAGCTTCTCTGCTTTTGGTCAATGTCTAGGAAATCTCTTCAGA 

AATAAGAAGCTATTTCATTAAGTGTGATATAZ\ACCTCCTCAAACATTTTACTTAGAGGCAAGGAT 

TGTCTAATTTCAATTGTGCAAGACATGTGCCTTATAATTATTTTTAGCTTAAAATTAAACAGATT 

T T G T AATAAT G T AAC T T T G T TAAT AGG T GCATAAACAC T AAT GC AG T C AAT T T GAAC AAAAGAAG 

TGACATACACAATATAAATCATATGTCTTCACACGTTGCCTATATAATGAGAAGCAGCTCTCTGA 

GGGTTCTGAAATCAATGTGGTCCCTCTCTTGCCCACTAAACAAAGATGGTTGTTCGGGGTTTGGG 

ATTGA.CACTGGAGGCAGATAGTTGCAAAGTTAGTCTAAGGTTTCCCTAGCTGTATTTAGCCTCTG 

ACTATATTAGTATACAAAGAGGTCATGTGGTTGAGACCAGGTGAATAGTCACTATCAGTGTGGAG 

ACAAGCACAGCACACAGACATTTTAGGAAGGAAAGGAACTACGAAATCGTGTGAAAATGGGTTGG 

AACCCATCAGTGATCGCATATTCATTGATGAGGGTTTGCTTGAGATAGAAAATGGTGGCTCCTTT 

CTGTCTTATCTCCTAGTTTCTTCAATGCTTACGCCTTGTTCTTCTCAAGAGAAAGTTGTAACTCT 

CTGGTCTTCATATGTCCCTGTGCTCCTTTTAACCAAATAAAGAGTTCTTGTTTCTGGGGGAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAA7VAAAA 
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FIGURE 168 



MSRWSLLLGAALLCGHGAFCRRWSGQKVCFADFKHPCYKMAYFHELSSRVSFQEARLACESE 

GGVLLSLENEAEQKLIESMLQNLTKPGTGISDGDFWIGLWRNGDGQTSGACPDLYQWSDGSNSQ 

YRNWYTDEPSCGSEKCVVMYHQPTANPGLGGPYLYQWNDDRCNMKHNYICKYEPEINPTAPVEK 

PYLTNQPGDTHQNVWTEAGIIPNLIYWIPTIPLLLLILVAFGTCCFQMLHKSKGRTKTSPNQ 
S TLWI SKS TRKE S GMEV 

Important features of the prdtein: 
Signal peptide: 

amino acids 1-21 

Transmembrane domain : 

amino acids 214-235 

N-glycosylation sites . 

amino acids 86-89, 255-258 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 266-2 69 

N-myristoylation sites . 

amino acids 27-32, 66-71, 91-96, 93-98, 102-107, 109-114, 140- 
145, 212-217 
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